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TITLE 

METHOD FOR CHROMOSOMAL ENGINEERING 
This application claims the benefit of U.S. Provisional Application 
No. 60/434,602 filed December 19, 2002. 
5 FIELD OF THE INVENTION 

This invention is in the field of microbiology. More specifically, this 
invention pertains to methods associated with in vivo chromosomal 
engineering. 

BACKGROUND OF THE INVENTION 

io The availability of complete bacterial genome sequences and the 

elucidation of metabolic pathways has resulted in the use of such 
knowledge to engineer microorganisms for the production of compounds 
of industrial interest. Microbial production of industrial compounds 
requires the ability to efficiently engineer changes to the genomes of the 

15 organisms. Engineering changes such as adding, removing, or modifying 
genetic elements has often proven to be challenging and time consuming 
exercises. One such modification is genetically engineering modulations 
to the expression of relevant genes in a metabolic pathway. 

There are a variety of ways to modulate gene expression. 

20 Microbial metabolic engineering generally involves the use of multi-copy 
vectors to express a gene of interest under the control of a strong or 
conditional promoter. This method of metabolic engineering for industrial 
use has several drawbacks. It is sometimes difficult to maintain the 
vectors due to segregational instability. Deleterious effects on cell viability 

25 and growth are often observed due to the vector burden. It is also difficult 
to control the optimal expression level of desired genes on a vector. To 
avoid the undesirable effects of using a multi-copy vector, a chromosomal 
integration approach using homologous recombination via a single 
insertion of bacteriophage X, transposons, or other suitable vectors 

30 containing the gene of interest has been used. However, this method also 
has drawbacks such as the need for multiple cloning steps in order to get 
the gene of interest into a suitable vector prior to recombination. Another 
drawback is the instability associated with the inserted genes, which can 
be lost due to excision. Lastly, this method has a limitation associated 

35 with multiple insertions and the inability to control the location of the 
insertion site on a chromosome. 

Recently, the use of the X-Red recombination system for PCR- 
mediated inactivation of chromosomal genes has been described (Murphy 
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et al., Gene, 246:321-330 (2000); Murphy, K., J. Bacterid., 
180:2063-2071 (1998); Poteete and Fenton, J. Bacteriol., 182:2336-2340 
(2000); Poteete, A., FEMS Microbiology Lett, 201:9-14 (2001); Datsenko 
and Wanner, PNAS, 97:6640-6645 (2000); Yu et al., PNAS, 
5 97:5978-5983 (2000); and Chaveroche et al., Nucleic Acids Research, 
28:e97:1-6 (2000)). 

Murphy et al. (supra, 1998) illustrate the use of the X-Red system 
and how it promotes recombination between bacterial chromosomes and 
a linear double-stranded (ds) DNA molecule introduced via 

10 electroporation. Several strains of bacteria, including Salmonella and 
enteropathogenic and enterohemorrhagic E. coli were successfully 
transformed using the system (Poteete, A., supra, 2001). However, the 
method of Murphy et al. (supra, 1998) used several hundred bases of 
homologous overlap in order to maximize the recombination efficiency. 

15 Stewart et al. (US 6,355,412) teach the use bacterial 

recombinases, namely the RecET and/or the X-Red system, for cloning 
and subcloning a target DNA fragment into a vector. The target DNA 
contains termini that share homology to corresponding homology arms on 
the vector. The target DNA is added to the vector via recombinase 

20 mediated homologous recombination. The methods described in 

US 6,355,412 are limited to adding target DNA to a vector. US 6,355,412 
does not teach a two-fragment method of chromosomal engineering nor 
does it teach a method useful for chromosomally engineering promoter 
and regulatory regions to artificially control the expression level of desired 

25 genes and/or operons in a biosynthetic pathway. A similar approach by 
Chaveroche et al. (supra) uses the A,-Red system for modification of a 
cosmid later used to transform via homologous recombination the genome 
of a filamentous fungi, Asperillus nidulas. Both examples illustrate the use 
of the A,-Red (or a functionally equivalent) system for genetic modification 

30 of a vector later used for homologous recombination with the host's 
genome or for epichromosomal transformation. 

Stewart et al. (US 6,509,156) teach the use of the RecET and/or X- 
Red system gene products for catalyzing homologous recombination 
between two DNA molecules (two regions of homology or "homology 

35 arms"). The US 6,509,156 does not teach a method based on triple 
homologous recombination (recombination between two linear DNA 
molecules and the bacterial chromosome) nor does it teach a method 
suitable for chromosomally engineering biosynthetic pathways in host 



bacteria. The methods described in both Stewart et al. patents would 
require one or more additional cloning steps prior to homologous 
recombination in order to integrate an expressible DNA fragment 
simultaneously with a selectable marker into the host ceirs chromosome. 
5 Zhang et al. (Nat Genet, 20:123-128, (1998)) teach the use of the 

RecET recombinase system in sbcA positive E. coli strains, illustrating the 
ability to stimulate homologous recombination for cloning and subcloning 
a target DNA fragment into a vector. Zhang et al. do not provide data on 
the A,-Red system nor do they teach a method of chromosomal 

10 engineering using triple homologous recombination (two DNA fragment 
method) that make it possible to integrate a promoter or gene along with a 
selectable marker in a single step without the need of a prior cloning step. 

Murphy et al. (supra, 2000) teach the use of the A,-Red system in 
E. coli and illustrate the utility of the system in creating gene knockouts 

15 without the need for a prior cloning step. The A,-Red system was 

compared to recombination proficient strains of E. coli and was found to 
be much more efficient, increasing recombination rates approximately 10 
to 100-fold. However, the described method required the use of 
approximately 1000 bp of homology on each end of a single linear dsDNA 

20 fragment for efficient transformation. Additionally, the intent of the 

described method was for open reading frame (ORF) functional analysis 
by gene knockout, resulting in the identification of unknown essential 
genes for drug development, but not for chromosomal integration of 
promoter(s) and/or gene(s). 

25 Pati et al. (US 6,074,853) teach the use of targeted homologous 

recombination for altering genetic sequences in vivo. Their complex multi- 
step method relies on the use of two single-stranded polynucleotides 
coated in vitro with purified E. coli RecA protein. The preferred target for 
recombination in Pati et al. is an epichromosomal sequence, such as a 

30 plasmid or other vector, with the RecA/single-stranded DNA mediated 
reaction occurring in vitro. The modified vector is then treated to remove 
the RecA protein prior to introduction into the targeted host cell. The 
present method is much less complicated and time consuming and targets 
the host ceirs genome directly. 

35 The problem to be solved therefore is to provide methods and 

materials useful for a facile, one-step method of in vivo chromosomal 
engineering of promoters and genes in bacteria, such as E. coli. A simple 
one-step mechanism does not exist which allows for direct modification of 



chromosomal genes, promoters, and other nucleic acid regulatory 
elements using homologous recombination. 

The present invention provides an easy, one-step, efficient method 
(lacking a cloning step used in previous methods) to stably integrate any 
5 promoter/regulatory region in front of any target gene in the E. coli 
chromosome so that the expression of such gene can be controlled. 
Additionally, the present method also permits repetitive cycles of 
chromosomal modifications for the creation and/or manipulation of 
biochemical pathways within a host cell using targeted addition and/or 

10 deletion of functional DNA molecules via the use triple homologous 

recombination between two linear, PCR-generated, DNA fragments and 
the host cell's chromosome. 

SUMMARY OF THE INVENTION 
The utility of the present chromosomal integration method in 

15 engineering bacterial biosynthetic pathways is illustrated using 

isoprenoid/carotenoid biosynthesis as an example. The promoters of the 
key genes that encode for rate-limiting enzymes involved in the isoprenoid 
pathway are engineered via the present method. The genetic 
modifications resulted in increased p-carotene production. Genes 

20 encoding enzymes involved in carotenoid biosynthesis, not normally found 
in E. coli, are also added via the present method to specific locations on 
the host's chromosome under the control of a strong promoter. The 
method allows for multiple chromosomal modifications within Escherichia 
coli, which is essential when engineering biosynthetic pathways for 

25 industrial purposes. Thus, the present method can be used for multiple 
chromosomal modifications, useful in engineering biosynthetic pathways. 
The method of homologous recombination using two linear PCR- 
generated fragments in the presence of the phage X-Red recombinase 
system is described in Figure 1. The method also includes the use of a 

30 site-specific recombinase for removal of selectable markers, allowing easy 
and efficient in vivo chromosomal engineering. 

Accordingly the invention provides a method for the directed 
integration of an expressible DNA fragment lacking a selectable marker 
into a bacterial chromosome comprising: 

35 a) providing at least one first recombination element having 

the general structure in the 5' to 3' direction: 
5'-RR1-RS-SM-RS-RR2-3'; wherein 
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(i) RR1 is a first recombination region of about 10 to 
50 bases; 

(ii) RS is a recombination site responsive to a site- 
specific recombinase; 

(iii) SM is a DNA fragment encoding a selectable 
marker; and 

(iv) RR2 is a second recombination region of about 10 
to 50 bases; 

b) providing at least one second recombination element 
having the general structure in a 5' to 3' direction: 

X-RR3; wherein 

(i) X is a DNA is an expressible DNA fragment having 
homology to the second recombination region; and 

(ii) RR3 is a third recombination of about 10-50 bases; 

c) providing a recombination proficient bacterial host 
harboring a A,-Red recombinase system, having a bacterial 
chromosome comprising: 

(i) a first chromosomal region having homology to 
said first recombination region; 

(ii) a second chromosomal region having homology to 
said third recombination region; 

d) transforming said recombination proficient host with the 
first and second recombination elements, wherein both 
elements are integrated into the bacterial chromosome 
between the first and second chromosomal regions forming 
a construct having the general structure in the 5' to 3' 
direction; 

5'-RR1 -RS-SM-RS-RR2-X-RR3; 

e) selecting and isolating transformed hosts having the 
construct of (d) on the basis of the selectable marker; 

f) expressing a site-specific recombinase in the isolated 
hosts of (e) wherein the selectable marker is excised from 
the chromosome and whereby the expressible DNA 
fragment is inserted into the bacterial chromosome, lacking 
the selectable marker. 

In similar fashion the invention provides a method for the directed 
integration of an expressible DNA fragment lacking a selectable marker 
into a bacterial chromosome comprising: 

5 ^ 



a) providing at least one first recombination element having 
the general structure in the 5' to 3' direction: 

5'-RR1 -RS-SM-RS-Y-RR2-3'; wherein 

(i) RR1 is a first recombination region of about 10 to 
50 bases; 

(ii) RS is a recombination site responsive to a site- 
specific recombinase; 

(iii) SM is a DNA fragment encoding a selectable 
marker; 

(iv) Y is a first expressible DNA fragment; and 

(v) RR2 is a second recombination region of about 10 
to 50 bases; 

b) providing at least one second recombination element 
having the general structure in a 5' to 3' direction: 

5'-X-RR3-3'; wherein 

(i) X is a DNA is a second expressible DNA fragment 
having homology to the second recombination region; 
and 

(ii) RR3 is a third recombination of about 10-50 bases; 

c) providing a recombination proficient bacterial host 
harboring a A,-Red recombinase system, and having a 
bacterial chromosome comprising: 

(i) a first chromosomal region having homology to 
said first recombination region; 

(ii) a second chromosomal region having homology to 
said third recombination region; 

d) transforming said recombination proficient host with the 
first and second recombination elements, wherein both 
elements are integrated into the bacterial chromosome 
between the first and second chromosomal regions forming 
a construct having the general structure in the 5' to 3' 
direction; 

5'-RR1 -RS-SM-RS-Y-RR2-X-RR3; 

e) selecting and isolating transformed hosts having the 
construct of (d) on the basis of the selectable marker; 

f) expressing a site-specific recombinase in the isolated 
hosts of (e) wherein the selectable marker is excised from 
the chromosome and whereby the first and second 

6 



expressible DNA fragments are inserted into the bacterial 
chromosome, lacking the selectable marker. 
In an alternate embodiment the invention provides a method for the 
integration of a foreign promoter in place of a bacterial chromosomal 
promoter in a recombination proficient host cell comprising: 

a) providing at least one first recombination element having 
the general structure in the 5' to 3' direction: 1 

5'-RR1-RS-SM-RS-RR2-3'; wherein 

(i) RR1 is a first recombination region of about 10 to 
50 bases; 

(ii) RS is a recombination site responsive to a site- 
specific recombinase; 

(iii) SM is a DNA fragment encoding a selectable 
marker; and 

(iv) RR2 is a second recombination region of about 10 
to 50 bases; 

b) providing at least one second recombination element 
having the general structure in a 5' to 3' direction: 

5'-FP-RR3-3'; wherein 

(i) FP is a promoter foreign to the recombination 
proficient host cell having homology to the second 
recombination region; and 

(ii) RR3 is a third recombination of about 10-50 bases; 

c) providing a recombination proficient bacterial host 
harboring a A,-Red recombinase system, having a bacterial 
chromosome comprising: 

(i) a first chromosomal region upstream of a bacterial 
promoter having homology to said first recombination 
region; 

(ii) a second chromosomal region, downstream of 
said bacterial promoter having homology to said third 
recombination region; 

d) transforming said recombination proficient host with the 
first and second recombination elements, wherein both 
elements are integrated into the bacterial chromosome 
between the first and second chromosomal regions forming 
a construct having the general structure in the 5' to 3' 
direction; 
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5'-RR1-RS-SM-RS-RR2-FP-RR3; 

e) selecting and isolating transformed hosts having the 
construct of (d) on the basis of the selectable marker; 

f) expressing a site-specific recombinase in the isolated 
hosts of (e) wherein the selectable marker is excised from 
the chromosome and whereby the foreign promoter is 
inserted into the bacterial chromosome in place of the 
bacterial promoter. 

In another embodiment the invention provides a method for the 
integration of an unlinked foreign promoter and foreign open reading 
frame into a bacterial chromosome in a recombination proficient host cell 
comprising: 

a) providing at least one first recombination element having 
the general structure in the 5' to 3' direction: 

5'-RR1-RS-SM-RS-FP-RR2-3'; wherein 

(i) RR1 is a first recombination region of about 10 to 
50 bases; 

(ii) RS is a recombination site responsive to a site- 
specific recombinase; 

(iii) SM is a DNA fragment encoding a selectable 
marker; 

(iv) FP is a promoter foreign to the recombination 
proficient host cell; and 

(iv) RR2 is a second recombination region of about 10 
to 50 bases; 

b) providing at least one second recombination element 
having the general structure in a 5' to 3' direction: 

5'-FO-RR3-3'; wherein 

(i) FO is an open reading frame foreign to the 
recombination proficient host cell having homology to 
the second recombination region; and 

(ii) RR3 is a third recombination of about 10-50 bases; 

c) providing a recombination proficient bacterial host 
harboring a A,-Red recombinase system, having a bacterial 
chromosome comprising: 

(i) a first chromosomal region upstream of a bacterial 
intra-operon chromosomal integration site having 
homology to said first recombination region; 
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(ii) a second chromosomal region, downstream of 
said bacterial intra-operon chromosomal integration 
site having homology to said third recombination 
region; 

d) transforming said recombination proficient host with the 
first and second recombination elements, wherein both 
elements are integrated into the bacterial chromosome 
between the first and second chromosomal regions forming 
a construct having the general structure in the 5' to 3' 
direction; 

5'-RR1-RS-SM-RS-FP-RR2-FO-RR3; 

e) selecting and isolating transformed hosts having the 
construct of (d) on the basis of the selectable marker; 

f) expressing a site-specific recombinase in the isolated 
hosts of (e) wherein the selectable marker is excised from 
the chromosome and whereby the foreign promoter and 
foreign open reading frame are inserted into the bacterial 
chromosome. 

In another preferred embodiment the invention provides a method 
for the integration of a foreign gene comprising a regulatory region and 
foreign open reading frame into a bacterial chromosome in a 
recombination proficient host cell comprising: 

a) providing at least one first recombination element having 
the general structure in the 5' to 3' direction: 

5'-RR1-RS-SM-RS-FG-RR2-3'; wherein 

(i) RR1 is a first recombination region of about 10 to 
50 bases; 

(ii) RS is a recombination site responsive to a site- 
specific recombinase; 

(iii) SM is a DNA fragment encoding a selectable 
marker; 

(iv) FG is a gene comprising a regulatory region, 
foreign to the recombination proficient host cell; and 
(iv) RR2 is a second recombination region of about 10 
to 50 bases; 

b) providing at least one second recombination element 
having the general structure in a 5' to 3' direction: 

5'-FO-RR3-3'; wherein 
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(i) FO is an open reading frame foreign to the 
recombination proficient host cell having homology to 
the second recombination region; and 

(ii) RR3 is a third recombination of about 10-50 bases; 

c) providing a recombination proficient bacterial host 
harboring a X-Red recombinase system, having a bacterial 
chromosome comprising: 

(i) a first chromosomal region upstream of a bacterial 
intra-operon chromosomal integration site having 
homology to said first recombination region; 

(ii) a second chromosomal region, downstream of 
said bacterial intra-operon chromosomal integration 
site having homology to said third recombination 
region; 

d) transforming said recombination proficient host with the 
first and second recombination elements, wherein both 
elements are integrated into the bacterial chromosome 
between the first and second chromosomal regions forming 
a construct having the general structure in the 5' to 3' 
direction; 

5'-RR1 -RS-SM-RS-FG-RR2-FO-RR3; 

e) selecting and isolating transformed hosts having the 
construct of (d) on the basis of the selectable marker; 

f) expressing a site-specific recombinase in the isolated 
hosts of (e) wherein the selectable marker is excised from 
the chromosome and whereby the foreign promoter and 
foreign open reading frame are inserted into the bacterial 
chromosome. 

BRIEF DESCRIPTION OF THE DRAWINGS, 
SEQUENCE DESCRIPTIONS AND BIOLOGICAL DEPOSITS 
Figure 1 illustrates the general method used in the present 

invention for a variety of in vivo bacterial chromosomal modifications. 
Figure 2 illustrates the method of chromosomal promoter 

engineering of the dxs gene using kanamycin selectable marker and 

phage 75 promoter (P7-5). 

Figure 3 illustrates the isoprenoid/carotenoid biosynthetic pathway. 
Figure 4 illustrates the A,-Red helper plasmid, pKD46 (Datsenko 

and Wanner, supra; GenBank® Accession Number AY048746). 
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Figure 5 illustrates the pPCB15 reporter plasmid construct 
containing several genes involved in carotenoid biosynthesis. 

Figure 6 illustrates the engineered chromosomal promoter 
construct and the PCR fragment analysis illustrating the kan-P T5 -dxs and 
5 Pjs-dxs constructs. 

Figure 7 illustrates the effects on p-carotene production by the 
introduction of a phage 75 promoter (P7-5) upstream of several isoprenoid 
genes. 

Figure 8 illustrates the method for introduction of the P T5 promoter 
10 upstream of the idi gene and the removal of the antibiotic selection 
marker. 

Figure 9 illustrates the PCR fragment analysis illustrating the /can- - 
Pjs-idi and Pjs-idi constructs. 

Figure 10 is a plasmid map of pSUH5 plasmid used for the 
15 preparation of the fused antibiotic marker- P T5 promoterf/can-P^) PCR- 
generated DNA fragment. 

Figure 1 1 illustrates the method for the introduction of the fused 
antibiotic marker kan-P T5 promoter and the Methylomonas sp. 16a dxs 
gene into the E. coli chromosome and the removal of the antibiotic 
20 selection marker. 

Figure 12 illustrates the PCR fragment analysis illustrating the /can- 
P T5 -dxs(16a) and P T5 -dxs(16a) constructs. 

Figure 13 illustrates the method for the introduction of the fused 
antibiotic marker kan-P T5 promoter and P. stewartii crtEIB genes into the 
25 E. coli chromosome. 

Figure 14 illustrates the PCR fragment analysis illustrating the /can- 
P T5 -crtEIB and P T5 -crtEIB constructs. 

The invention can be more fully understood from the following 
detailed description and the accompanying sequence descriptions, which 
30 form a part of this application. 

The following sequences comply with 37 C.F.R. 1 .821-1 .825 
("Requirements for Patent Applications Containing Nucleotide Sequences 
and/or Amino Acid Sequence Disclosures - the Sequence Rules") and are 
consistent with World Intellectual Property Organization (WIPO) Standard 
35 ST.25 (1998) and the sequence listing requirements of the EPO and PCT 
(Rules 5.2 and 49.5(a-bis), and Section 208 and Annex C of the 
Administrative Instructions). The symbols and format used for nucleotide 
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and amino acid sequence data comply with the rules set forth in 
37 C.F.R. §1.822. 

SEQ ID NO:1 is the first of two primer sequences used to PCR 
amplify the linear DNA fragment containing a kanamycin resistance gene 
5 from plasmid pKD4 (Datsenko and Wanner., supra.] GenBank® 

Accession Number AY048743) designated as "T1(dxs)" and contains a 
homology arm designated as "hi" chosen to match sequences in the 
upstream region of the ispA stop codon (Figure 2). 

SEQ ID NO:2 is the second of two primer sequences used to PCR 

io amplify the linear DNA fragment containing a kanamycin resistance gene 
from plasmid pKD4 designated as "B1(T5)" and contains a homology arm 
designated as "h2" chosen to match sequences in the 5'-end region of the 
phage 75 promoter (P75) DNA fragment (Figure 2). 

SEQ ID NO:3 is the first of two primer sequence used to PCR 

15 amplify the linear DNA fragment containing a P T5 promoter comprising the 
-10 and -35 consensus sequences, lac operator (/acO), and ribosomal 
binding site (rbs) from plasmid pQE30 (Qiagen Inc., Valencia, CA) and is 
designated as "T2(T5)" (Figure 2). The primer is later used to verify by 
PCR amplification the chromosomal construct of the integration of the 

20 kanamycin selectable marker and the P T5 promoter upstream of the dxs 
gene (Figure 6). 

SEQ ID NO:4 is the second of two primer sequences used to PCR 
amplify the linear DNA fragment containing a P T5 promoter comprising the 
-10 and -35 consensus sequences, lac operator (/acO), and ribosomal 
25 binding site (rbs) from pQE30 (Qiagen, Inc.) and is designated as 

"B2(dxs)" and contains a homology arm designated as "h3" chosen to 
match sequences in the downstream region of the dxs start codon 
(Figure 2). 

SEQ ID NO:5 is a primer used to verify by PCR amplification the 
30 chromosomal construct of the integration of the kanamycin selectable 

marker and the P T5 promoter upstream of the dxs gene and is designated 
as"T1" (Figure 6). 

SEQ ID NO:6 is a primer used to verify by PCR amplification the 
chromosomal construct of the integration of the kanamycin selectable 
35 marker and the P T5 promoter upstream of the dxs gene and is designated 
as "T2" (Figure 6). 

SEQ ID NO:7 is a primer used to verify by PCR amplification the 
chromosomal construct of the integration of the kanamycin selectable 
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marker and the P T5 promoter upstream of the dxs gene and is designated 
as "T3" (Figure 6). 

SEQ ID NO:8 is a primer used to verify by PCR amplification the 
chromosomal construct of the integration of the kanamycin selectable 
5 marker and the P T5 promoter upstream of the dxs gene and is designated 
as "T4" (Figure 6). 

SEQ ID NO:9 is the first of a primer pair used to PCR amplify a 
DNA fragment containing a kanamycin resistance selectable marker from 
plasmid pKD4 and is designated as 'T(idi)" and contains a homology arm 
10 designated as "h4" chosen to match sequences in the upstream region of 
the ygfU stop codon (Figure 8). 

SEQ ID NO: 10 is the second of a primer pair used to PCR amplify 
a DNA fragment containing a P T5 promoter comprising the -10 and -35 
consensus sequences, lac operator (/acO), and ribosomal binding site 
15 (rbs) from pQE30 (Qiagen, Inc.) and is designated as "B(idi)" and contains 
a homology arm designated as "h5" chosen to match sequences in the 
downstream region of the idi start codon (Figure 8). 

SEQ ID NO:1 1 is a primer used for PCR analysis confirming 
chromosomal integration of the kanamycin selectable marker and the P T5 
20 promoter upstream of the idi gene and is designated as "T5" (Figure 9). 

SEQ ID NO: 12 is a primer used for PCR analysis confirming 
chromosomal integration of the kanamycin selectable marker and the Pj 5 
promoter upstream of the idi gene and is designated as "T6" (Figure 9). 

SEQ ID NO: 1 3 is the first of a primer pair used to PCR amplify a 
25 DNA fragment containing the P T5 promoter element comprising the -1 0 
and -35 consensus sequences, /acO, and ribosomal binding site (rbs) 
from pQE30 (Qiagen, Inc.) and is designated as "Tt5". 

SEQ ID NO: 14 is the second of a primer pair used to PCR amplify 
a DNA fragment containing the Pj 5 promoter element comprising the -10 
30 and -35 consensus sequences, /acO, and ribosomal binding site (rbs) 
from pQE30 (Qiagen, Inc.) and is designated as "Bt5". 

SEQ ID NO: 15 is the nucleotide sequence of the ORF encoding the 
D-1-deoxyxylulose-5-phosphate synthase (DXS) enzyme from 
Methylomonas 16a which catalyzes the condensation of pyruvate and D- 
35 glyceraldehyde 3-phosphate to D-1-deoxyxylulose 5-phosphate. 

SEQ ID NO: 16 is the first of a primer pair used to PCR amplify a 
DNA fragment containing a fused kanamycin selectable marker- P T5 
promoter from pSUH5 and is designated as l T1(dxs16a)" and contains a 



homology arm designated as "h6" chosen to match sequence in the region 
between genes located at 30.9 min of the E. coli chromosome (Figure 11). 

SEQ ID NO: 17 is the second of a primer pair used to PCR amplify 
a DNA fragment containing a fused kanamycin selectable marker- P T5 
5 promoter from pSUH5 and is designated as "B1(dxs16a) M and contains a 
homology arm designated as "h7" chosen to match sequence in the 
downstream region of the dxs(16a) start codon (Figure 11). 

SEQ ID NO: 18 is the first of a primer pair used to PCR amplify a 
DNA fragment containing Methylomonas dxs(16a) gene from 
io Methylomonas 16a genomic DNA and is designated as "T2(dxs16a)" and 
contains a homology arm designated as "h8" chosen to match sequences 
in the 3'-end region of the fused kanamycin selectable marker- P T5 
promoter (Figure 11). 

SEQ ID NO: 19 is the second of a primer pair used to PCR amplify 
15 a DNA fragment containing Methylomonas dxs(16a) gene from 

Methylomonas 16a genomic DNA and is designated as "B2(dxs16a)" and 
contains a homology arm designated as "h9" chosen to match sequences 
in the region between genes located at 30.9 minutes of the E. coli 
chromosome (Figure 11). 
20 SEQ ID NO:20 is a primer used for PCR analysis confirming 

chromosomal integration of the fused kan-Pj 5 promoter and dxs(16a) 
gene and is designated as "T7" (Figure 12). 

SEQ ID NO:21 is a primer used for PCR analysis confirming 
chromosomal integration of the fused kan-P T5 promoter and dxs(16a) 
25 gene and is designated as "T8" (Figure 12). 

SEQ ID NO:22 is a primer used for PCR analysis confirming 
chromosomal integration of the fused kan-P T5 promoter and dxs(16a) 
gene and is designated as "T9" (Figure 12). 

SEQ ID NO:23 is the first of a primer pair used to PCR amplify a 
30 DNA fragment containing a fused kanamycin selectable marker- P T5 
promoter from pSUH5 and is designated as "T1(crtE)" and contains a 
homology arm designated as "h10" chosen to match sequence in the 
inter-operon region located at 81.2 minutes of the E. coli chromosome 
(Figure 13). 

35 SEQ ID NO:24 is the second of a primer pair used to PCR amplify 

a DNA fragment containing a fused kanamycin selectable marker- P T5 
promoter from pSUH5 and is designated as "B1(crtE)" and contains a 
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homology arm designated as "h1T chosen to match sequence in the 
downstream region of the crtE start codon (Figure 13). 

SEQ ID NO:25 is the first of a primer pair used to PCR amplify a 
DNA fragment containing P. stewartii crtE gene from pPCB15 (Figure 5) 
5 and is designated at "T2(crtE)" and contains a homology arm designated 
as "h8" chosen to match sequences in the 3'-end region of the fused 
kanamycin selectable marker- P T5 promoter (Figure 13). 

SEQ ID NO:26 is the second of a primer pair used to PCR amplify 
a DNA fragment containing P. stewartii crtE gene from pPCB15 and is 
10 designated at "B2(crtE)" and contains a homology arm designated "h12" 
chosen to match sequences in the inter-operon region located at 81.2 
minutes on the E. coli chromosome (Figure 13). 

SEQ ID NO:27 is a primer used for PCR amplify a DNA fragment 
containing a priming sequence (26 bp) corresponding to the 162 bases 
15 upstream region of the integration site of the fused kanamycin marker- 
P T5 promoter-P stewartii crtE gene from E. coli P T5 -crtE and is 
designated as "T10" (Figure 13). 

SEQ ID NO:28 is the second of a primer pair used to amplify the 
fused kanamycin marker- P T5 promoter-P. stewartii crtE gene from E. coli 
20 P T5 -crtE and is designated as "B1(crtlB)" and contains a homology arm 
designated as "h13" chosen to match sequences in the downstream 
region of the crtl start codon (Figure 1 3). 

SEQ ID NO:29 is the first of a primer pair used to PCR amplify the 
P. stewartii crtIB linked gene pair from the plasmid pPCB15 and is 
25 designated as "T2(crtlB)" and contains a homology arm designated as 
"h14" chosen to match sequences in the 3'-end region of the fused 
kanamycin selectable marker- P T5 promoter-P. stewartii crtE DNA 
fragment (Figure 13). 

SEQ ID NO:30 is the second of a primer pair used to PCR amplify 
30 the P. stewartii crtIB linked gene pair from the plasmid pPCB15 and is 
designated as "B2(crtlB)" and contains a homology arm designated "h12" 
(same arm used for "B2(crtE)" primer) chosen to match sequences in the 
inter-operon region located at 81.2 min on the E. coli chromosome 
(Figure 13). 

35 SEQ ID NO:31 is a primer used to verify by PCR amplification the 

chromosomal construct of the integration of the fused kanamycin 
selectable marker- P T5 promoter-P. stewartii crtE and the P. stewartii crtIB 
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operon in the inter-operon region located at 81.2 min on the E. coli 
chromosome and is designated as "T1 1" (Figure 14). 

SEQ ID NO:32 is a primer used to verify by PCR amplification the 
chromosomal construct of the integration of the fused kanamycin 
5 selectable marker- Pj 5 promoter-P. stewartii crtE and the P. stewartii crtIB 
operon in the inter-operon region located at 81.2 min on the E. coli 
chromosome and is designated as "T12" (Figure 14). 

SEQ ID NO:33 is a primer used to verify by PCR amplification the 
chromosomal construct of the integration of the fused kanamycin 
10 selectable marker- P T5 promoter-P. stewartii crtE and the P. stewartii crtIB 
operon in the inter-operon region located at 81.2 min on the E. coli 
chromosome and is designated as "T13" (Figure 14). 

SEQ ID NO:34 is the nucleotide sequence for plasmid pKD4 
(Datsenko and Wanner, supra) having GenBank® Accession number 
15 AY048743 and was used as a PCR template to amplify the DNA fragment 
containing a kanamycin resistance gene flanked by FR T site-specific 
recombinase recognition sequences. 

SEQ ID NO:35 is the nucleotide sequence for plasmid pKD46 
(Datsenko and Wanner, supra) having GenBank® Accession number 
20 AY048746. Plasmid pKD46 expresses the components of the X-Red 
Recombinase system. 

SEQ ID NO:36 is the nucleotide sequence for plasmid pSUH5 
which was used as a template for PCR amplifying the fused kanamycin 
resistance gene- P T5 promoter. 
25 SEQ ID NO:37 is the nucleotide sequence for plasmid pPCB15 

which was used as a reporter plasmid construct containing several genes 
involved in carotenoid biosynthesis, and as a template for PCR amplifying 
the crtE, and crtIB genes. 

Applicants have made the following biological deposit under the 
30 terms of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the purposes of Patent Procedure: 

Depositor Identification Infl. Depository 

Reference Designation Date of Deposit 

Plasmid pCP20 ATCC PTA-4455 June 13, 2002 

Methylomonas 16a ATCC PTA-2402 August 22, 2000 
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As used herein, "ATCC" refers to the American Type Culture 
Collection International Depository Authority located at ATCC, 10801 
University Blvd., Manassas, VA 20110-2209, USA. The "International 
Depository Designation" is the accession number to the culture on deposit 
5 with ATCC. 

The listed deposits will be maintained in the indicated international 
depository for at least thirty (30) years and will be made available to the 
public upon the grant of a patent disclosing it. The availability of a deposit 
does not constitute a license to practice the subject invention in 

10 derogation of patent rights granted by government action. 

DETAILED DESCRIPTION OF THE INVENTION 
The invention relates to a method for the directed integration of 
promoters and/or expressible DNA fragments into a bacterial chromosome 
in a single step. The method employs at least two linear double-stranded 

15 (ds) recombination elements or constructs that carry various expressible 
DNA elements such as promoters, foreign genes or open reading frames 
to be chromosomally integrated. Each recombination element comprises a 
recombination region having regions of homology to either the other 
recombination element or portions of the bacterial chromosome. A unique 

20 feature of the present method is the use of at least two linear double- 
stranded (ds) recombination elements to chromosomally integrate various 
expressible DNA elements such as a promoter or foreign gene or open 
reading frame on one recombination element in combination with a 
selectable marker on the other recombination element which is bounded 

25 by site-specific recombinase sites to allow for facile excision when the 

marker is no longer needed. Another feature of the present method is the 
use of the A,-Red recombinase system in a recombination proficient host. 
The A,-Red system increases the utility of the method by facilitating 
efficient recombination between linear recombination elements and the 

30 chromosome over relatively small regions of homology. The present 
method facilitates the replacement of native bacterial chromosomal 
promoters with foreign or artificial promoters. Promoter additions and/or 
replacements can be combined with chromosomal integration of foreign 
genes for tunable expression by placing the foreign genes under the 

35 control of a regulatory promoter. 

The present method can be used to perform additional chromosomal 
modifications other than promoter replacement, including the chromosomal 
integration of one or more foreign genes. Functional expression of these 
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modifications was measured by an increase in the production of p-carotene 
(orange color) or by the production of lycopene (pink color). 

In this disclosure, a number of terms and abbreviations are used. 
The following definitions are provided. 
5 "Open reading frame" is abbreviated ORF. 

"Polymerase chain reaction" is abbreviated PCR. 
The term "directed integration" or "targeted integration" means the 
integration of an expressible DNA fragment into a bacterial chromosome 
whereby the integration is effected by corresponding regions of homology 
10 on the a cassette comprising the expressible DNA and the chromosome. 
The term "expressible DNA fragment" means any DNA that 
influences phenotypic changes in the host cell. An "expressible DNA 
fragment" may include for example, DNA comprising regulatory elements, 
isolated promoters, open reading frames, genes, or combinations thereof. 
15 The term "recombination element" refers to a linear nucleic acid 

construct useful for the transformation of a recombination proficient 
bacterial host. Recombination elements of the invention may include a 
variety of genetic elements such as selectable markers, expressible DNA 
fragments, and recombination regions having homology to regions on a 
20 bacterial chromosome or on other recombination elements. 

The terms "recombination region" and "homology arm" are used 
interchangeably and refer to a nucleotide sequence that enables 
homologous recombination between two nucleic acids having substantially 
the same nucleotide sequence in a particular region of two different 
25 nucleic acids. The preferred size range of the nucleotide sequence of the 
homology arm is from about 10 to about 50 nucleotides. 

The term "recombinase" refers to one or more enzymes, which 
either work alone or in combination to stimulate homologous 
recombination. The "A,-Red recombinase", "?L-Red recombination 
30 system", and "A,-Red system" are used interchangeably to describe a 
group of enzymes encoded by the bacteriophage X genes exo, bet, and 
gam. The enzymes encoded by the three genes work together to increase 
the rate of homologous recombination in E. co//, an organism generally 
considered to have a relatively low rate of homologous recombination; 
35 especially when using linear recombination elements. 

The term "site-specific recombinase system" is used to describe a 
system comprised of one or more enzymes which recognize specific 
nucleotide sequences (recombination target sites) and which catalyze 
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recombination between the recombination target sites. Site-specific 
recombination provides a method to rearrange, delete, or introduce 
exogenous DNA. Examples of site-specific recombinases and their 
associated recombination target sites are flippase (FLP)/FR7, Cre//ox, 
5 Xer/d/f, and Int/atf, to name a few The present invention illustrates the 
use of a site-specific recombinase to remove selectable markers. 
Antibiotic resistance markers, flanked on both sides by FRT recombination 
target sites, are removed by expression of the FLP site-specific 
recombinase. The use of site-specific recombinases to remove selectable 
10 markers permits multiple chromosomal modifications necessary for 
microbial pathway engineering and is not limited to the number of 
available selection markers (Huang et a!., J. BacterioL, 179(19): 6076- 
6083 (1997)). 

The term "selectable marker" means a gene product that, when 
15 present, enables one to identify and preferentially propagate a particular 
cell type. 

The term "recombination proficient bacterial host" is used to 
describe a bacterial host that contains a functional recombination system 
and is capable of homologous recombination at rates useful for genetic 

20 engineering. 

The term "homology" as applied to recombination regions and 
corresponding regions on a bacterial chromosome means nucleotide 
sequences sharing identical or nearly identical sequences. 
Complementary sequences between regions on the bacterial chromosome 

25 and recombination regions can associate and undergo homologous 
recombination in the presence of a recombinase system. Preferred 
recombination regions, or homology arms, are those having identical 
sequences to the corresponding regions on the bacterial chromosome and 
are about 10 to about 50 bp in length. 

30 As used herein the term "upstream" when used in reference to a 

region of DNA means the 5' side of a particular gene or sequence of 
nucleotides. 

As used herein the term "downstream" when used in reference to a 
region of DNA means the 3' side of a particular gene or sequence of 
35 nucleotides. 

The term "inter-operon chromosomal integration site" refers to a 
chromosomal site where integration of exogenous DNA using the current 
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invention is targeted and where integration of the exogenous DNA will not 
disrupt the functionality of an endogenous operon or gene within the host. 

The term "regulatory circuit" refers to functionally integrated DNA 
which can interact with a substance (signaling molecule) to either increase 
5 or decrease transcription. For example, the material could be a repressor 
protein that binds to an operator site (specific DNA sequence located near 
or overlapping a promoter's RNA polymerase binding site). Binding of the 
repressor hinders or prevents transcription of the associated gene. 
Conversely, the substance may be an activator protein that binds to a 

10 specific DNA sequence which facilitates RNA polymerase binding, 
"activating" or increasing transcription of an associated gene. In the 
present invention, the term "positive regulatory site" refers to functionally 
integrated DNA that (either alone or in response to interaction with a 
substance) enhances transcription of an associated coding sequence. 

15 The term "negative regulatory site" refers to functionally integrated DNA 
that (either alone or in response to interaction with a substance) 
decreases or blocks transcription of an associated coding sequence. 

As used herein, an "isolated nucleic acid fragment" is a polymer of 
RNA or DNA that is single- or double-stranded, optionally containing 

20 synthetic, non-natural or altered nucleotide bases. An isolated nucleic 
acid fragment in the form of a polymer of DNA may be comprised of one 
or more segments of cDNA, genomic DNA or synthetic DNA. 

The term "isoprenoid" or "terpenoid" refers to the compounds and 
any molecules derived from the isoprenoid pathway including 10 carbon 

25 terpenoids and their derivatives, such as carotenoids and xanthophylls. 

The terms "Methylomonas 16a strain" and "Methylomonas 16a" 
(ATCC PTA-2402, deposited August 22, 2000) are used interchangeably 
and refer to a bacterium of a physiological group of bacteria known as 
methanotrophs, which are unique in their ability to utilize methane as a 

30 substrate. 

The term "methanotroph" means a prokaryote capable of utilizing 
methane as a substrate. Complete oxidation of methane to carbon 
dioxide occurs by aerobic degradation pathways. Typical examples of 
methanotrophs include, but are not limited to the genera Methylomonas, 
35 Methylobacter, Methylococcus, and Methylosinus. 

The term "dxs" refers to the enzyme D-1-deoxyxylulose 5- 
phosphate encoded by the E. coli dxs gene which catalyzes the 
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condensation of pyruvate and D-glyceraldehyde 3-phosphate to D-1- 
deoxyxylulose 5-phosphate. 

The term "dxs(16a)" refers to the enzyme D-1-deoxyxyIulose 5- 
phosphate encoded by the Methylomonas 16a strain dxs gene which 
5 catalyzes the condensation of pyruvate and D-glyceraldehyde 3- 
phosphate to D-1-deoxyxylulose 5-phosphate. 

The term u idt refers to the enzyme isopentenyl diphosphate 
isomerase encoded by the E. coli idi gene that converts isopentenyl 
diphosphate to dimethylallyl diphosphate. 
10 The term "triple homologous recombination" in the present 

invention refers to a genetic recombination between two linear DNA 
fragments and the target chromosome via their homologous sequences 
"resulting in chromosomal integration of the two linear nucleic acid 
fragments into the target chromosome. 
15 The term "pPCB15" refers to the plasmid containing p-carotene 

synthesis genes Pantoea crtEXYIB, used as a reporter plasmid for 
monitoring p-carotene production in E coli that is genetically engineered 
via the present method. 

The term "pSUH5" refers to the plasmid that was constructed by 
20 cloning a phage 75 promoter (P75) region into the A/del restriction 

endonuclease site of pKD4 (Datsenko and Wanner, supra). It was used 
as a template plasmid for PCR amplification of a fused kanamycin 
selectable marker- Pj 5 promoter linear DNA molecule. 

The terms "P T5 promoter" and "phage 75 promoter" refer to the 
25 nucleotide sequence that comprises the -10 and -35 consensus 

sequences from phage 75, lactose operator (/acO), and ribosomal binding 
site {rbs) from pQE30 (Qiagen, Inc.). 

The term "helper plasmid" refers to either pKD46 encoding A,-Red 
recombinase system or pCP20 encoding the FLP site-specific 
30 recombinase (Datsenko and Wanner, supra). 

The term "E coir when applied to Escherichia coli K-12 derivatives 
will mean the MG1655 (ATCC 47076) and MC1061 (ATCC 53338) strains 

The term "Pantoea stewartii subsp. stewartif is abbreviated as 
"Pantoea stewartif (ATCC 8199) and is used interchangeably with 
35 Erwinia stewartii (Mergaert et al., IntJ. Syst Bacterid., 43:162-173 
(1993)). 

The term "Pantoea crtEXYIB cluster" refers to a gene cluster 
containing carotenoid synthesis genes crtEXYIB amplified from Pantoea 
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stewartii ATCC 8199. The gene cluster contains the genes crtE, crtX, 
crtY, crtl, and crtB. The cluster also contains a crtZ gene organized in 
opposite direction adjacent to crtB gene. In the present invention, the 
gene cluster is located on the reporter plasmid pPCB15 and is used to 
5 monitor the effects of various chromosomal modifications on p-carotene 
production. 

The term "CrtE" refers to geranylgeranyl pyrophosphate synthase 
enzyme encoded by crtE gene which converts trans-trans-farnesyl 
diphosphate + isopentenyl diphosphate to pyrophosphate + 
10 geranylgeranyl diphosphate. 

The term "CrtY" refers to lycopene cyclase enzyme encoded by 
crtY gene which converts lycopene to beta-carotene. 

The term "Crtl" refers to phytoene dehydrogenase enzyme encoded 
by crtl gene which converts phytoene into lycopene via the intermediaries 
15 of phytofluene, zeta-carotene and neurosporene by the introduction of 4 
double bonds. 

The term "CrtB" refers to phytoene synthase enzyme encoded by 
crtB gene which catalyzes reaction from prephytoene diphosphate 
(geranylgeranyl pyrophosphate) to phytoene. 
20 The term "CrtX" refers to zeaxanthin glucosyl transferase enzyme 

encoded by crtX gene which converts zeaxanthin to zeaxanthin-p- 
diglucoside. 

The term "CrtZ" refers to the p-carotene hydroxylase enzyme 
encoded by crtZ gene which catalyses both the hydroxylation reaction 
25 from p-carotene to zeaxanthin or the hydroxylation of canthaxanthin to 
astaxanthin. 

The term "carotenoid biosynthetic enzyme" is an inclusive term 
referring to any and all of the enzymes encoded by the crtEXYIB gene 
cluster. The enzymes include CrtE, CrtY, Crtl, CrtB, and CrtX. 

30 "Codon degeneracy" refers to the nature in the genetic code 

permitting variation of the nucleotide sequence without affecting the amino 
acid sequence of an encoded polypeptide. The skilled artisan is well 
aware of the "codon-bias" exhibited by a specific host cell in usage of 
nucleotide codons to specify a given amino acid. Therefore, when 

35 synthesizing a gene for improved expression in a host cell, it is desirable 
to design the gene such that its frequency of codon usage approaches the 
frequency of preferred codon usage of the host cell. 
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"Synthetic genes" can be assembled from oligonucleotide building 
blocks that are chemically synthesized using procedures known to those 
skilled in the art. These building blocks are ligated and annealed to form 
gene segments which are then enzymatically assembled to construct the 
5 entire gene. "Chemically synthesized", as related to a sequence of DNA, 
means that the component nucleotides were assembled in vitro. Manual 
chemical synthesis of DNA may be accomplished using well-established 
procedures, or automated chemical synthesis can be performed using one 
of a number of commercially available machines. Accordingly, the genes 

10 can be tailored for optimal gene expression based on optimization of 

nucleotide sequence to reflect the codon bias of the host cell. The skilled 
artisan appreciates the likelihood of successful gene expression if codon 
usage is biased towards those codons favored by the host. Determination 
of preferred codons can be based on a survey of genes derived from the 

15 host cell where sequence information is available. 

"Gene" refers to a nucleic acid fragment that expresses a specific 
protein, including regulatory sequences preceding (5* non-coding 
sequences) and following (3' non-coding sequences) the coding 
sequence. "Native gene" refers to a gene as found in nature with its own 

20 regulatory sequences. "Chimeric gene" refers to any gene that is not a 
native gene, comprising regulatory and coding sequences that are not 
found together in nature. Accordingly, a chimeric gene may comprise 
regulatory sequences and coding sequences that are derived from 
different sources, or regulatory sequences and coding sequences derived 

25 from the same source, but arranged in a manner different than that found 
in nature. "Endogenous gene" refers to a native gene in its natural 
location in the genome of an organism. A "foreign gene" or "exogenous 
gene" refers to a gene not normally found in the host organism, but that is 
introduced into the host organism by gene transfer. Foreign genes can 

30 comprise native genes inserted into a non-native organism, or chimeric 
genes. A "transgene" is a gene that has been introduced into the genome 
by a transformation procedure. 

"Operon", in bacterial DNA, is a cluster of contiguous genes 
transcribed from one promoter that gives rise to a polycistronic mRNA. 

35 "Coding sequence" refers to a DNA sequence that codes for a 

specific amino acid sequence. "Suitable regulatory sequences" refer to 
nucleotide sequences located upstream (5' non-coding sequences), 
within, or downstream (3' non-coding sequences) of a coding sequence, 
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and which influence the transcription, RNA processing or stability, or 
translation of the associated coding sequence. Regulatory sequences 
may include promoters, translation leader sequences, introns, 
polyadenylation recognition sequences, RNA processing site, effector 
5 binding site and stem-loop structures. 

"Promoter" refers to a DNA sequence capable of controlling the 
expression of a coding sequence or functional RNA. In general, a coding 
sequence is located 3' to a promoter sequence. Promoters may be 
derived in their entirety from a native gene, or be composed of different 

10 elements derived from different promoters found in nature, or even 

comprise synthetic DNA segments. It is understood by those skilled in the 
art that different promoters may direct the expression of a gene in different 
tissues or cell types, or at different stages of development, or in response 
to different environmental or physiological conditions. Promoters which 

15 cause a gene to be expressed in most cell types at most times are 

commonly referred to as "constitutive promoters". It is further recognized 
that since in most cases the exact boundaries of regulatory sequences 
have not been completely defined, DNA fragments of different lengths 
may have identical promoter activity. As used herein a "foreign promoter" 

20 will refer to a promoter derived from a source other than the cell in which 
the promoter is being used. For the purposes of the present invention 
foreign promoters used in the present microbial systems may be derived 
from other microbes, such as bacteria, yeasts and fungi, as well as 
eukaryotic sources such as plants and mammalian cells. 

25 "RNA transcript" refers to the product resulting from RNA 

polymerase-catalyzed transcription of a DNA sequence. When the RNA 
transcript is a perfect complementary copy of the DNA sequence, it is 
referred to as the primary transcript or it may be an RNA sequence 
derived from post-transcriptional processing of the primary transcript and 

30 is referred to as the mature RNA. "Messenger RNA (mRNA)" refers to the 
RNA that is without introns and that can be translated into protein by the 
cell. "cDNA" refers to a double-stranded DNA that is complementary to 
and derived from mRNA. "Sense" RNA refers to RNA transcript that 
includes the mRNA and so can be translated into protein by the cell. 

35 "Antisense RNA" refers to an RNA transcript that is complementary to all 
or part of a target primary transcript or mRNA and that blocks the 
expression of a target gene (U.S. Patent No. 5,107,065; WO 9928508). 
The complementarity of an antisense RNA may be with any part of the 



specific gene transcript, i.e., at the 5' non-coding sequence, 3' non-coding 
sequence, or the coding sequence. "Functional RNA" refers to antisense 
RNA, ribozyme RNA, or other RNA that is not translated yet has an effect 
on cellular processes. 
5 The term "operably linked" refers to the association of nucleic acid 

sequences on a single nucleic acid fragment so that the function of one is 
affected by the other. For example, a promoter is operably linked with a 
coding sequence when it is capable of affecting the expression of that 
coding sequence (i.e., that the coding sequence is under the 

10 transcriptional control of the promoter). Coding sequences can be 

operably linked to regulatory sequences in sense or antisense orientation. 

The term "expression", as used herein, refers to the transcription 
and stable accumulation of sense (mRNA) or antisense RNA derived from 
the nucleic acid fragment of the invention. Expression may also refer to 

15 translation of mRNA into a polypeptide. 

"Transformation" refers to the transfer of a nucleic acid fragment 
into the genome of a host organism, resulting in genetically stable 
inheritance. Host organisms containing the transformed nucleic acid 
fragments are referred to as "transgenic" or "recombinant" or 

20 "transformed" organisms. 

The terms "plasmid", "vector" and "cassette" refer to an extra 
chromosomal element often carrying genes which are not part of the 
central metabolism of the cell, and usually in the form of circular double- 
stranded DNA fragments. Such elements may be autonomously 

25 replicating sequences, genome integrating sequences, phage or 

nucleotide sequences, linear or circular, of a single- or double-stranded 
DNA or RNA, derived from any source, in which a number of nucleotide 
sequences have been joined or recombined into a unique construction 
which is capable of introducing a promoter fragment and DNA sequence 

30 for a selected gene product along with appropriate 3' untranslated 

sequence into a cell. "Transformation cassette" refers to a specific vector 
containing a foreign gene and having elements in addition to the foreign 
gene that facilitates transformation of a particular host cell. "Expression 
cassette" refers to a specific vector containing a foreign gene and having 

35 elements in addition to the foreign gene that allow for enhanced 
expression of that gene in a foreign host. 

The invention relates to a method for the directed integration of 
expressible DNA into a bacterial chromosomal. The DNA is directed to the 



bacterial chromosome by homologous recombination between regions of 
homology on a cassette comprising the DNA and the chromosome in a 
recombination proficient bacterial host harboring the X-Red recombinase 
system. At lest two linear cassettes or recombination elements are 
5 employed. Typically one recombination element comprises a selectable 
marker bounded by site-specific recombinase sites. Both 5' and 3' to the 
recombinase sites are recombination regions ("homology arms") having 
homology to either portions of the bacterial chromosome or to another 
recombination element. Additionally the recombination element may also 

10 comprise expressible DNA such as a promoter or foreign gene or open 
reading frame. The second recombination element will typically contain 
many of the same elements as the first, with the exception of the 
selectable marker bounded by recombination target sites (recognized by a 
site-specific recombinase). 

15 A,-Red Recombinase System 

The A,-Red recombinase system used in the present invention is 
contained on a helper plasmid (pKD46) and is comprised of three 
essential genes, exo, bet, and gam (Figure 4)(Datsenko and Wanner, 
supra). The exo gene encodes an A,-exonuclease, which processively 

20 degrades the 5' end strand of double-stranded (ds) DNA and creates 3' 
single-stranded overhangs. Bet encodes for a protein which complexes 
with the X-exonuclease and binds to the single stranded DNA and 
promotes renaturation of complementary strands and is capable of 
mediating exchange reactions. Gam encodes for a protein that binds to 

25 the E. coli RecBCD complex, inhibiting endonuclease activity. 

The X-Red system is used in the present invention because 
homologous recombination in E. coli occurs at a very low frequency and 
usually requires extensive regions of homology. The A,-Red system 
facilitates the ability to use short regions of homology (10-50 bp) flanking 

30 linear dsDNA fragments for homologous recombination. Additionally, the 
RecBCD complex normally expressed in E. coli prevents the use of linear 
dsDNA for transformation as the complex's exonuclease activity efficiently 
degrades linear dsDNA. Inhibition of the RecBCD complex's 
endonuclease activity by gam is essential for efficient homologous 

35 recombination using linear dsDNA fragments. 

An important aspect of the present invention is the chromosomal 
promoter replacement in combination with a selection marker by using two 
PCR-generated, linear, double stranded DNA fragments. The invention 



involves the use of a triple homologous recombination event between two 
PCR-generated fragments and the E. coli chromosome (Figure 1). The 
present method permits directed chromosomal engineering via triple 
homologous recombination without the need for a separate cloning step. 
5 The scope of the invention permits directed integration of functional DNA 
molecules (promoters, genes, regulatory elements, operons, etc.). The 
present method has no limitation in terms of chromosomal integration site 
and number. Therefore, the method is suitable for redesigning 
biosynthetic pathways, optimizing metabolic flux, and creating novel 
10 pathways. 

Another aspect of the present method is the incorporation of 
selectable markers, such as antibiotic resistance, in one of the two linear 
double stranded DNA fragments. The selectable marker, such as one 
influencing antibiotic resistance, is used to select for those colonies that 

15 have undergone triple homologous recombination using the present 
invention. However, multiple chromosomal manipulations are needed 
when engineering bacterial biosynthetic pathways. In order to accomplish 
this, the method includes the use of site-specific recombination sequences 
flanking the selectable marker. Use of such site-specific recombinases to 

20 remove selectable markers is known in the art (Borges et al., 

WO 01/18222 A1; Bouhassira et aL, WO 00/63410; and Martinez- 
Morales, et al., J. Bacteriol., 181:7143-7148 (1999)). Site-specific 
recombinases, such as the use of flippase (FLP) recombinase in the 
present invention, recognize specific recombination sequences (i.e. FRT 

25 sequences) and allow for the excision of the selectable marker. This 
aspect of the invention enables the repetitive use of the present method 
for multiple chromosomal modifications. The invention is not limited to the 
FLP-FRT" recombinase system as several examples of site specific 
recombinases and their associated specific recognition sequences are 

30 know in the art. Examples of other suitable site-specific recombinases 
and their corresponding recognition sequences include, but are not limited 
to Cre//ox, Xer/d/7, and Int/atf. 

Another aspect of the invention relates to its use in operon 
assembly on the bacterial chromosome by sequential integration of genes. 

35 Prokaryotic operons can be created using the present invention. An 

example of this aspect is illustrated in Example 13 where three carotenoid 
biosynthesis genes are integrated into the bacterial chromosome under 
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the control of a single regulatory element (Figure 13) via triple homologous 
recombination. 

The present method allows for stable chromosomal integration of 
any promoter/regulatory region in front of any target gene in the E. coli 
5 chromosome so that the expression of such gene can be controlled. The 
method can also be used to engineer a variety of genetic elements, in 
addition to promoters, in the custom design of biosynthetic pathways. The 
approach is suitable for constructing industrially useful microbial strains, 
rather than just increased expression of a specific single gene. In terms of 

10 metabolic balance, productivity, control, stability, and optimal expression 
of the genes of a particular pathway, the approach has many advantages 
and benefits when compared to metabolic engineering based on a 
recombinant vector approach. The method is illustrated using E. coli as 
an example, but the method should prove to be useful in other bacterial 

is strains as well (Poteete, A., supra % 2001). Examples of other industrially 
useful strains, which could be genetically engineered using the present 
invention, include Salmonella, Methylomonas, Pseudomonas, Bacillus and 
Acinetobacter. 
Carotenoid Biosynthesis 

20 Using several examples, the utility of the present method is 

illustrated by engineering increases in p-carotene (a carotenoid) 
production in an E. coli reporting strain. Carotenoids are pigments that 
are ubiquitous throughout nature and synthesized by all oxygen evolving 
photosynthetic organisms, and in some heterotrophic growing bacteria 

25 and fungi. Industrial uses of carotenoids include pharmaceuticals, food 
supplements, electro-optic applications, animal feed additives, and 
colorants in cosmetics, to mention a few. 

Because animals are unable to synthesize carotenoids de novo, 
they must obtain them by dietary means. Thus, manipulation of 

30 carotenoid production and composition in plants or bacteria can provide 
new or improved source for carotenoids. 

Carotenoids come in many different forms and chemical structures. 
Most naturally occurring carotenoids are hydrophobic tetraterpenoids 
containing a C40 methyl-branched hydrocarbon backbone derived from 

35 successive condensation of eight C5 isoprene units (IPP). In addition, 

novel carotenoids with longer or shorter backbones occur in some species 
of non-photosynthetic bacteria. The term "carotenoid" actually includes 
both carotenes and xanthophylls. A "carotene" refers to a hydrocarbon 
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carotenoid. Carotene derivatives that contain one or more oxygen atoms, 
in the form of hydroxy-, methoxy-, oxo-, epoxy-, carboxy-, or aldehydic 
functional groups, or within glycosides, glycoside esters, or sulfates, are 
collectively known as "xanthophylls". Carotenoids are furthermore 
5 described as being acyclic, monocyclic, or bicyclic depending on whether 
the ends of the hydrocarbon backbones have been cyclized to yield 
aliphatic or cyclic ring structures (G. Armstrong, (1999) In Comprehensive 
Natural Products Chemistry , Elsevier Press, volume 2, pp 321-352). 
E. coli contains genes in the isoprenoid biosynthetic pathway 
10 (Figure 3). Isoprenoid biosynthesis starts with the condensation of 
pyruvate with glyceraldehyde-3-phosphate (G3P) to form deoxy-D- 
xylulose via the enzyme encoded by the dxs gene. A host of additional 
enzymes are then used in subsequent sequential reactions, converting 
deoxy-D-xylulose to the final C5 isoprene product, isopentenyl 
15 pyrophosphate (IPP). IPP is converted to the isomer dimethylallyl 

pyrophosphate (DMAPP) via the enzyme encoded by the idi gene. IPP is 
condensed with DMAPP to form C10 geranyl pyrophosphate (GPP) which 
is then elongated to C15 farnesyl pyrophosphate (FPP). 

FPP synthesis is common in both carotenogenic and non- 
20 carotenogenic bacteria. E. coli do not normally contain the genes 

necessary for conversion of FPP to p-carotene (Figure 3). Because of 
this, the Applicant used an E. coli strain containing a reporter plasmid, 
pPCB15, which has the additional genes necessary for p-carotene 
production (Figure 5). Enzymes in the subsequent carotenoid pathway 
25 used to generate carotenoid pigments from FPP precursor can be divided 
into two categories: carotene backbone synthesis enzymes and 
subsequent modification enzymes. The backbone synthesis enzymes 
include geranyl geranyl pyrophosphate synthase (CrtE), phytoene 
synthase (CrtB), phytoene dehydrogenase (Crtl) and lycopene cyclase 
30 (CrtY/L), etc. The modification enzymes include ketolases, hydroxylases, 
dehydratases, glycosylases, etc. 

The genetics of carotenoid pigment biosynthesis are well known 
(Armstrong et al., J. Bact. 176: 4795-4802 (1994); Annu. Rev. Microbiol., 
51 :629-659 (1997)). This pathway is extremely well studied in the Gram- 
35 negative, pigmented bacteria of the genera Pantoea, formerly known as 
Erwinia. In both E. herbicola EHO-10 (ATCC 39368) and E. uredovora 
20D3 (ATCC 19321), the c/t genes are clustered in two operons, crt Z and 
crtEXYIB (US 5,656,472; US 5,5545,816; US 5,530,189; US 5,530,188; 



and US 5,429,939). Despite the similarity in operon structure, the DNA 
sequences of E. uredovora and E. herbicola crt genes show no homology 
by DNA-DNA hybridization (US 5,429,939,). 

The E. coli dxs and idi genes encode for enzymes considered to be 
5 rate-controlling steps in the isoprenoid biosynthetic pathway (Figure 3). 
The chromosomal copies of these genes have very low expression levels 
in the wild-type E. coli. The utility of the present invention is illustrated by 
integrating the phage 75 strong promoter (P75) in front of the dxs or idi 
genes resulting in increased the flux of the isoprenoid pathway. The result 

10 of the increased flux was detected as a measurable increase in p- 
carotene production (Figure 7). 
Recombination Elements 

As used in the present invention, "recombination elements" are 
linear double-stranded DNA fragments or "cassettes" created using PCR. 

15 Typically the recombination elements will range from about 25 bases to 
about 4000 bases. The present method uses two recombination elements 
for in vivo bacterial chromosomal engineering. The linear elements 
contain functional DNA (regulatory elements, promoters, coding 
sequences, genes, etc.), optionally flanked by site-specific recombination 

20 sequences if the functional DNA is to be later removed as in the case of a 
selection marker. The 5' and 3' ends of the recombination elements are 
comprised of recombination regions (homology arms). The homology 
arms, generally about 10 to 50 base pairs in length, are chosen so have 
homology with either a specific sequence on the bacterial chromosome or 

25 a specific sequence on another recombination element. 

The present invention illustrates the use of two recombination 
elements. As illustrated in Figure 1, the 5' end of the "first" recombination 
element contains a recombination region ("RR1") having sequence 
homology to a targeted sequence on the bacterial chromosome. The 3' 

30 end of the first element also contains a recombination region ("RR2") 
having sequence homology to the 5' end of the "second" recombination 
element. The 3' end of the second recombination element ("RR3") shares 
homology with a targeted sequence on the bacterial chromosome. During 
A,-Red mediated homologous recombination, the two elements are 

35 combined and inserted into a specific site on the bacterial chromosome 
based on the sequences of the homology arms. The actual sequence of 
the triple homologous recombination events may vary. 
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The preferred length of the homology arms is about 10 to about 50 
base pairs in length. Given the relatively short lengths of the homology 
arms used in the present invention for homologous recombination, one 
would expect that the level of acceptable mismatched sequences should 
5 be kept to an absolute minimum for efficient recombination, preferably 
using sequences which are identical to those targeted for homologous 
recombination. From 20 to 40 base pairs of homology, the efficiency of 
homologous recombination increases by four orders of magnitude (Yu et 
al. PNAS. 97:5978-5983. (2000)). Therefore, multiple mismatching within 
10 homology arms may decrease the efficiency of homologous 

recombination. A large size (up to about 40 K base pairs) of the 
recombination elements can be PCR generated accurately by Taq DNA 
polymerase such as MasterAmp™ Extra-long PCR Enzyme (Epicentre, 
Inc. Madison, Wl). 

15 The present invention illustrates the use of a selectable marker 

("SM" in Figure 1) as the expressible DNA on one of the two 
recombination elements. The markers are flanked by site-specific 
recombinase recognition sequences (recognition target sequences, 
denoted as "RS" in Figure 1). The recombination element containing the 

20 marker is usually not incorporated into the bacterial chromosome unless it 
is inserted simultaneously and in the correct orientation with the "second" 
recombination element. Expression of the selectable marker in the 
genetically modified bacteria is an indication that a triple homologous 
recombination event has probably occurred. After selection and construct 

25 verification, a site-specific recombinase is used to remove the marker. 

The steps of the present invention can then be repeated with additional in 
vivo chromosomal modifications. The use of selectable markers is know 
in the art and can include, but is not limited to, antibiotic resistance 
markers such as ampicillin, kanamycin, and tetracycline resistance. 

30 Selectable markers may also include amino acid biosynthesis enzymes 
(for selection of auxotrophs normally requiring the exogenously supplied 
amino acid of interest) and enzymes which catalyze visible changes in 
appearance such as p-galactosidase (catalyzes the conversion of Xgal 
into an easily discernable blue pigment) in lac bacteria. 

35 Recombination proficient hosts 

Recombination proficient hosts are hosts that contain a functional 
recombination system and are capable of efficiently being transformed by 
homologous recombination. In a preferred embodiment, the 
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transformation via homologous recombination is targeted to the bacterial 
chromosome. In another preferred embodiment, the DNA used during the 
transformation process is PCR-generated, double-stranded DNA. E. coli, 
a host generally considered as one which does not undergo efficient 
5 transformation via homologous recombination naturally, is altered to make 
it a recombination proficient host. Transformation with a helper plasmid 
containing the A,-Red recombinase system increases the rate of 
homologous recombination several orders of magnitude. The X-Red 
system can also be chromosomally integrated into the host. The X-Red 
10 system contains three genes (exo, bet, and gam) which change the 
normally recombination deficient E. coli into a recombination proficient 
host. 

Normally, E. coli efficiently degrades linear dsDNA via its RecBCD 
endonuclease, resulting in transformation efficiencies not useful for 

15 chromosomal engineering. Gam encodes for a protein that binds to the 
E. coifs RecBCD complex and blocks the complex's endonuclease 
activity. The exo gene encodes for a ^-exonuclease that processively 
degrades the 5' end strand of double-stranded (ds) DNA and creates 3' 
single stranded overhangs. The protein encoded by the bet gene 

20 complexes with the A,-exonuclease and binds to the single-stranded DNA 
overhangs and promotes renaturation of complementary strands and is 
capable of mediating exchange reactions. The A,-Red recombinase 
system allows one to use homologous recombination as a tool for in vivo 
chromosomal engineering in hosts, such as E. coli, normally considered 

25 difficult to transform by homologous recombination. The X-Red system 
works in other bacteria as well (Poteete, A., supra, 2001). E. coli has 
been the primary focus of the A,-Red system because it is a commonly 
used industrial production host. There is no evidence that the A,-Red 
system should not be applicable to other hosts generally used for 

30 industrial production. These additional hosts include, but are not limited to 
Salmonella, Methylomonas, Pseudomonas, Bacillus, and Acinetobacter. 
Transformation and Selection 

The simple cloning bypass method described by the present 
invention allows one to conduct in vivo chromosomal engineering in a 

35 bacterial host. The method allows for repetitive targeted chromosomal 
modifications resulting in the ability to engineer biosynthetic pathways. 
The transformation process can be divided into several distinct steps. 
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First, recombination elements are PCR generated as previously 
described. A pair of dsDNA recombination elements is electroporated into 
a recombination proficient bacterial host. In the present invention, the 
host is E. coli MC1061 harboring a helper plasmid, pKD46, which contains 
5 the A,-Red recombinase system under the control of an arabinose 

inducible promoter. After electroporation, the host cells are cultured in the 
presence of arabinose. The X-Red genes are expressed and triple 
homologous recombination occurs between the two recombination 
elements and the bacterial chromosome. One of the two chromosomally 

10 integrated recombination elements contains a selectable marker 
(kanamycin resistance; denoted as kan R or /can). 

After a period of time, the host cells are selected on a culture media 
containing a compound (i.e. kanamycin in the present invention) which 
selects for those cells that have undergone triple homologous 

15 recombination. In the present invention, a second resistance marker is 
use to confirm the presence of the pPCB15 reporter plasmid which 
contains a chloramphenicol resistance (Cam R ) gene. Colonies that grow 
in the presence of the selection medium are usually sequenced or 
analyzed by PCR fragment analysis to confirm the constructs. 

20 After the confirmation of the constructs has occurred, a second 

helper plasmid is transfected into the host cell. The second "helper" 
plasmid contains a site-specific recombinase (Flp) which recognizes the 
site-specific recombinase target sequences (FRTs) flanking the 
chromosomally integrated selectable marker. The components expressed 

25 on both of the helper plasmids in the present invention could be 

chromosomally integrated into the host cell under the control of conditional 
promoters. The site-specific recombinase, Flp (flippase) in the present 
invention, is expressed and the selectable marker is excised. The 
colonies are cultured in the presence/absence of the selection compound 

30 to identify colonies that have lost the selection marker. The constructs of 
these colonies are confirmed by sequence analysis or by PCR fragment 
analysis or a combination of both. The second helper plasmid is cured 
from the host using a temperature-sensitive replication origin. The 
process described above can be repeated, allowing for targeted microbial 

35 pathway engineering and in the creation of industrially useful production 
hosts. 
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Description of the Preferred Embodiments 

An easy one-step method of bacterial in vivo chromosomal 
engineering has been developed using two linear double-stranded (ds), 
PCR-generated, DNA fragments. The fragments were designed to 
5 contain short flanking regions of homology between the fragments and the 
target site on the host (E. coli) chromosome. 

The phage A,-Red recombinase system expressed on the helper 
plasmid pKD46 (GenBank® Accession number AY048746; SEQ ID 
NO:35; Figure 4) and under control of an arabinose-inducible promoter 

10 was used, resulting in controllable and efficient in vivo triple homologous 
recombination. At least one of the two PCR-generated linear dsDNA 
fragments used during recombination was designed to contain a selective 
marker (kanamycin) flanked by site-specific recombinase sequences 
(FRT) (Example 1). The marker allowed for identification and selection of 

15 the cells that had undergone the desired recombination event. Once the 
constructs of the selected recombinants were verified by sequence 
analysis or PCR fragment analysis, the selective marker was excised by a 
second helper plasmid containing the site-specific recombinase gene 
under the control of the P R promoter of X phage (Examples 3 and 4). 

20 In one embodiment, the method was used for bacterial promoter 

engineering. A strong promoter (phage P T5 ) was placed either upstream 
of the E. coli target gene dxs (Example 2) or idi (Example 6) via triple 
homologous recombination between two PCR-generated linear dsDNA 
fragments and the targeted chromosomal DNA (Figures 2 and 8). In each 

25 example, one of the two fragments contained a kanamycin resistance 
marker flanked by site-specific FRT recombinase sequences. Flanking 
the site-specific recombinase sequences were homology arms that 
contained short (approximately 10-50 bp) regions of homology. As 
illustrated in Figure 2, a first recombination region ("hi") was linked to the 

30 5'-end of the first fragment. A second recombination region ("h2") was 
linked to the 3'-end of the first fragment. The second PCR generated 
linear dsDNA fragment contained the P75 strong promoter. The third 
recombination region ("h3") was linked to the 3'-end of the second 
fragment. The first recombination region ("hi") had homology to an 

35 upstream portion of the native bacterial chromosomal promoter targeted 
for replacement. The second recombination region ("h2") located on the 
3'-end of the first fragment had homology to the 5'-end portion of the 
second fragment. The third recombination region ("h3") had homology to 



a downstream portion of the native bacterial chromosomal promoter 
targeted for replacement (Figure 2). 

The E. coli host, containing the A,-Red recombination system on the 
helper plasmid pKD46 (Figure 4), was transformed with the two PCR- 
5 generated fragments resulting in the chromosomal replacement of the 
targeted native promoter with the construct containing the kanamycin 
selectable marker of first fragment and the P T5 strong promoter on the 
second fragment (Examples 4 and 7, Figures 2 and 8). The promoter 
replacement resulted in the formation of an augmented E. coli 

10 chromosomal gene (either dxs or idi gene), operably linked to the 
introduced non-native promoter. The bacterial host cells that had 
undergone the desired recombination event were selected according to 
the expression of the selectable marker and their ability to grow in 
selected media. The selected recombinants were then transformed with a 

15 second helper plasmid, pCP20 (ATCC PTA-4455; Datsenko and Wanner, 
supra), expressing the flippase (Flp) site-specific recombinase which 
excised the selectable marker (Figure 2 and 8, Examples 3 and 6). The 
constructs were confirmed via PCR fragment analysis (Figures 6 and 9, 
Examples 4 and 7). The recombinant bacterial host cell containing the 

20 augmented gene (either dxs or idi) and the carotenoid reporter plasmid 
(pPCB15; SEQ ID NO:37) was then tested for increased production of p- 
carotene. Placement of either the E. coli dxs or idi genes, normally 
expressed at very low levels, under control of the P7-5 strong promoter 
resulted in significant increases in p-carotene production (Example 5, 

25 Figure 7). 

In another embodiment, a reporter strain of E. coli was constructed 
for assaying p-carotene production. Briefly, the E. coli reporter strain was 
created by cloning the gene cluster crtEXYIB from Pantoea stewartii into a 
helper plasmid (pPCB15) which was subsequently used to transform the 

30 E. coli host strain (Figure 5). Identification and selection for host cells 

containing the reporter plasmid was accomplished by culturing the cells in 
chloramphenicol (pPCB15 contains the chloramphenicol resistance gene, 
Cam R ). The cluster contains many of the genes required for the synthesis 
of carotenoids, with p-carotene normally produced in the transformed 

35 E. coli (Figure 7). It should be noted that the crtZ gene (p-carotene 
hydroxylase) was included in the gene cluster. However, since no 
promoter was present to express the crtZ gene (organized in opposite 
direction and adjacent to crtB gene) no zeaxanthin was produced. Thus, 

35 



the zeaxanthin glucosyl transferase enzyme (encoded by the crtX gene 
located within the gene cluster) had no substrate for its reaction. 
Increases in p-carotene production were reported as increases relative to 
the control strain production (Figure 7). 
5 The general method can be used for a variety of chromosomal 

modifications based on the genetic information contained on the two PCR 
generated fragments. In another embodiment, the method was used to 
simultaneously add a foreign gene and promoter. The first of the two 
PCR-generated fragments was designed so that is contains the fusion 

10 product of a selectable marker (kanamycin) and promoter (P75) (Example 
8, Figure 10 and 11)). The second PCR-generated fragment contained a 
foreign gene, the Methylomonas 16a strain dxs gene denoted as 
"dxs(16a)" . Once again, homology arms were designed to allow for 
precise incorporation into the host bacterial chromosome. The gene of 

15 interested can be simply added or used to replace a targeted native gene 
depending upon the selected homology arms used. The desired 
recombinants were selected by methods previously described. The 
selectable marker was then removed by a site-specific recombinase as 
previously described (Figure 11, Example 9). The recombinant constructs 

20 were confirmed by PCR fragment analysis (Figure 12, ExamplelO). The 
effect on p-carotene production in the transformed E coli reporter strain 
was measured as previously described. Cells containing the 
Methylomonas 16a strain dxs gene (homologous to the E. coli dxs gene) 
under the control of the P T5 strong promoter exhibited significant 

25 increases in p-carotene production (Figure 7). The present one-step 

method using homologous recombination was useful in the simultaneous 
addition of a foreign promoter and gene. Subsequent removal of the 
selectable marker is required so that the process can be repeated, if 
desired, to engineer bacterial biosynthetic pathways so that the final result 

30 is increased production of the desired product. 

In another embodiment, the applicability of the present method was 
illustrated by the creation of an E. coli operon. The prokaryotic operon 
created was comprised of three Pantoea stewartii carotenoid biosynthesis 
genes (c/fE, crtl, and crtB genes) operably linked to a single strong 

35 promoter (P75). The first PCR-generated linear dsDNA fragment was 

comprised of a fusion product between a kanamycin selectable marker, a 
P75 strong promoter, and a gene of interest (Pantoea stewartii crtE) 
operably linked to the P T5 promoter (kan-P T5 -crtE, Figure 13). The 



kanamycin selectable marker was flanked by site-specific recombinase 
sites for later removal as previously described. The first fragment was 
flanked by appropriate homology arms on the 5' and 3-ends. A second 
PCR-generated linear dsDNA fragment was constructed to contain the 
5 foreign P. stewartii crtIB genes and contained an appropriate 3'-end 

homology arm (Figure 13, Example 12). Triple homologous recombination 
occurred as previously described resulting in the chromosomal integration 
and formation of an operon containing the crtE gene of first fragment 
closely associated with the crtIB genes of the second fragment, operably 

10 linking all of the genes to the P T5 promoter on the first fragment. Once 
again, the marker was removed after selection in an appropriate media as 
previously described. The constructs were confirmed by PGR fragment 
analysis (Figure 14, Example 13). The transformed E. coli strain 
containing the functionally expressed crtEIB operon did not contain the 

15 pPCB15 reporter plasmid as in previous examples. It should be noted 
that the pPCB15 reporter plasmid also contained a functionally expressed 
crtY gene (lycopene cyclase) which was responsible for converting 
lycopene (pink color) into p-carotene (orange color). Expression of the 
crtEIB operon in the transformed E. coli strain lacking the pPCB15 helper 

20 plasmid was confirmed by the accumulation of pink colonies (indicative of 
lycopene accumulation) (Example 13). 

In another embodiment, a bacterial host strain can be engineered 
to contain multiple chromosomal modifications, including multiple 
promoter and gene additions or replacements so that the production 

25 efficiency of the desired final product is increased. In a preferred 

embodiment, the incorporated or augmented chromosomal genes encode 
for enzymes useful for the production of carotenoids. In a further 
preferred embodiment, multiple copies of genes may be chromosomally 
engineered into the host genome. 

30 The present one-step method can be used for a variety of targeted 

in vivo bacterial chromosomal modifications. The genetic information 
contained within the two PCR-generated dsDNA fragments (and their 
associated homology arms) determines the type and location of the 
desired chromosomal modification. The elimination of a cloning step used 

35 in previous methods of transformation eliminates a lot of laboratory time 
and its associated expenses. The removal of the selectable marker using 
a site-specific recombinase allows for one to conduct multiple 
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chromosomal modifications, necessary for engineering biosynthetic 
pathways and for optimizing production of industrially useful materials. 

EXAMPLES 

The present invention is further defined in the following Examples. 
5 It should be understood that these Examples, while indicating preferred 
embodiments of the invention, are given by way of illustration only. From 
the above discussion and these Examples, one skilled in the art can 
ascertain the essential characteristics of this invention, and without 
departing from the spirit and scope thereof, can make various changes 
10 and modifications of the invention to adapt it to various usages and 
conditions. 

GENERAL METHODS 

Standard recombinant DNA and molecular cloning techniques used 
in the Examples are well known in the art and are described by Sambrook, 

15 J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory 
Manual] Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 
(1989) (Maniatis) and by T. J. Silhavy, M. L. Bennan, and L. W. Enquist, 
Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY (1984) and by Ausubel, F. M. et al., Current Protocols 

20 in Molecular Biology, pub. by Greene Publishing Assoc. and Wiley- 
Interscience (1987). 

Materials and methods suitable for the maintenance and growth of 
bacterial cultures are well known in the art. Techniques suitable for use in 
the following examples may be found as set out in Manual of Methods for 

25 General Bacteriology (Phillipp Gerhardt, R. G. E. Murray, Ralph N. 

Costilow, Eugene W. Nester, Willis A. Wood, Noel R. Krieg and G. Briggs 
Phillips, eds), American Society for Microbiology, Washington, DC. (1994)) 
or by Thomas D. Brock in Biotechnology: A Textbook of Industrial 
Microbiology , Second Edition, Sinauer Associates, Inc., Sunderland, MA 

30 (1989). All reagents, restriction enzymes and materials used for the 
growth and maintenance of bacterial cells were obtained from Aldrich 
Chemicals (Milwaukee, Wl), DIFCO Laboratories (Detroit, Ml), 
GIBCO/BRL (Gaithersburg, MD), or Sigma Chemical Company (St. Louis, 
MO) unless otherwise specified. 

35 Manipulations of genetic sequences were accomplished using the 

suite of programs available from the Genetics Computer Group Inc. 
(Wisconsin Package Version 9.0, Genetics Computer Group (GCG), 
Madison, Wl). Where the GCG program "Pileup" was used the gap 



creation default value of 12, and the gap extension default value of 4 were 
used. Where the CGC "Gap" or "Bestfit" programs were used the default 
gap creation penalty of 50 and the default gap extension penalty of 3 were 
used. Multiple alignments were created using the FASTA program 
5 incorporating the Smith-Waterman algorithm (W. R. Pearson, Comput. 
Methods Genome Res., [Proc. Int. Symp.] (1994), Meeting Date 1992, 
111-20. Editor(s): Suhai, Sandor. Publisher: Plenum, New York, NY). In 
any case where program parameters were not prompted for, in these or 
any other programs, default values were used. 

10 The meaning of abbreviations is as follows: "h" means hour(s), 

"min" means minute(s), "sec" means second(s), "d" means day(s), "ml_" 
means milliliters), "jaL" means microliter(s), "L" means liter(s), and "rpm" 
means revolutions per minute. 
A,-Red Recombinase plasmid system 

15 The plasmids (pKD4, pKD46 and pCP20) used in the present 

invention have been previously described in the literature (Datsenko and 
Wanner, supra). The sequences for the A,-Red recombinase system 
components are contained on helper plasmid pDK46 (SEQ ID NO:35; 
GenBank® Accession No. AY048746). The Flp/FRT site-specific 

20 recombinase helper plasmid used in the present invention was pCP20 
(ATCC Number PTA-4455) Plasmid pKD4 (SEQ ID NO:34; GenBank® 
Accession No. AY048743) was used as a template molecule for PCR 
amplification of the FRT flanked kanamycin resistance marker). 

EXAMPLE 1 

25 Synthesis of Two Linear PCR-qenerated DNA Fragments Used for 
Engineering the Promoter of dxs Gene on the E. coli Chromosome 
The linear DNA fragment (1489 bp) containing a kanamycin 
selectable marker flanked by site-specific recombinase target sequences 
(FRT) was synthesized by PCR from plasmid pKD4 (Datsenko and 

30 Wanner, supra) with primer pairs, 
T1(dxs) (5'- 

TGGAAGCGCTAGCGGACTACATCATCCAGCGTAATAAATAA CGTCTT 
GAGCGATTGTGTAG-3 , ) (SEQ ID NO: 1) which contains a hi homology 
arm (underlined, 41 bp) chosen to match sequences in the upstream 
35 region of the ispA stop codon and a priming sequence (20 bp) and B1(T5) 
(5'- CTCGAGGTGAAGACGAAAGGGCCTCGTGATACGCCTATTTTTATA 
GGTTA CATATGAATATCCT CCTTAG -3') (SEQ ID NO:2) that contains a 
h2 homology arm (underlined, 50 bp) chosen to match sequences in the 
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5'-end region of the promoter DNA fragment and a priming sequence 
(20 bp) (Figure 2). A second linear DNA fragment (154 bp) containing a 
phage promoter (P7-5) comprised of the -10 and -35 consensus 
sequences, lac operator (/acO), and ribosomal binding site (rbs) was 
5 synthesized by PCR from plasmid pQE30 (Qiagen, Valencia, CA) with 
primer pairs, T2(T5) (5'-TAACCTATAAAAA TAGGCGTATCACGAGG 
CCC-3') (SEQ ID NO:3) that contains a priming sequence (32 bp) and 
B2(dxs) (S'- GGAGTCGACCAG TGCCAGGGTCGGGTATTTGG 
CAATATCAAAAC TCATA GTTAATTTCTCCTCTTTAATG-3') (SEQ ID 

10 NO:4) that contains a h3 homology arm (underlined, 48bp) chosen to 
match sequences in the downstream region of the dxs start codon and a 
priming sequence (22 bp) (Figure 2). The underlined sequences illustrate 
each respective homology arm, while the remainder is the priming 
sequences for hybridization to complementary nucleotide sequences on 

15 the template DNA for the PCR reaction. The two resultant PCR fragments 
were the kanamycin selectable marker containing the homology arms (hi 
and h2) and the Pj 5 promoter containing the homology arm (h3) as 
illustrated in Figure 2. Standard PCR conditions were used to amplify the 
linear DNA fragments with AmpliTag Gold® polymerase (Applied 

20 Biosystems, Foster City, CA) as follows; 



PCR reaction: PCR reaction mixture: 

Stepl 94 °C 3 min 0.5 jaL plasmid DNA 

Step2 93 °C 30 sec 5 1 0X PCR buffer 

25 Step3 55 °C 1 min 1 jaL dNTP mixture (10 mM) 

Step4 72 °C 3 min 1 jaL 5'-primer (20 jiM) 

Step5 Go To Step2, 30 cycles 1 jaL 3'-primer (20 jaM) 

Step6 72 °C 5 min 0.5 Taq DNA polymerase 

41 ^iL sterilized dH 2 0 

30 

After completing the PCR reactions, PCR products were purified 
using the QIAquick Gel Extraction Kit™ (Cat. # 28704, QIAGEN Inc. 
Valencia, CA). Briefly, 50 jaL of each PCR reaction mixture was run on a 
1 % agarose gel. The band containing the amplified DNA fragment was 
35 excised from the agarose gel with a clean, sharp scalpel. The gel slice 
was weighted in a colorless plastic tube. After adding 3 volumes of buffer 
QG to 1 volume of gel, the tube was incubated at 50 °C for 10 min (or until 
the gel slice has completely dissolved). After the gel slice has dissolved 
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completely, one gel volume of isopropanol was added to the sample and 
mixed by inverting the tube several times. In order to bind DNA, the 
sample was applied to the QIAquick gel extraction column and centrifuged 
for 1 min. 750 ^iL of buffer PE was applied to the column for washing. 
5 After centrifuging for 1 min to remove the wash buffer, PCR DNA products 
were eluted with 10 \iL of distilled water (dH 2 0) by standing sample for 1 
min, and then by centrifuging for 1 min. DNA Clean & Concentrator™ 
(Zymo Research, Orange, CA) was used to further purify the PCR DNA 
samples. After adding 2 volumes of DNA Binding Buffer to each volume 

10 of DNA sample, the samples were loaded into a Zymo-Spin Column 

(Zymo Research) and centrifuged at full speed (>1 0,000 g) for 5-10 sec. 
The PCR DNA sample retained in the column was washed twice with - 
200 |aL of Wash Buffer by centrifuging at top speed for 5-10 seconds. The 
DNA was eluted with 6-8 |aL of distilled water by spinning at top speed two 

15 times. The concentration of PCR DNA sample was about 0.5-1.0 jug/jaL. 

EXAMPLE 2 

Electro-transformation of Two Linear PCR-generated DNA Fragments into 
an E. coli Strain Expressing A,-Red Recombinase 
The E. coli MC1061 strain, carrying both a A,-Red recombinase 

20 expression plasmid (Figure 4) and a reporter plasmid (Figure 5), was used 
as a host strain for the recombination of PCR fragments. The strain was 
constructed by the co-transformation with a X-Red recombinase 
expression plasmid, pKD46 (amp R ) (Datsenko and Wanner, supra) 
(Figure 4) and a p-carotene biosynthesis expression plasmid, pPCB15 

25 (cam R ) (Figure 5) into the E. coli strain MC1061. The plasmid pPCB15 
contains the carotenoid gene cluster (crtEXYIB) from Pantoea Stewartii 
(ATCC 8199). The X-Red recombinase in pKD46 is comprised of three 
genes exo, bet, and gam expressed under the control of an arabinose- 
inducible promoter. Transformants were selected on 100 jig/mL ampicillin 

30 and 25 |ag/mL chloramphenicol LB plates at 30 °C. 

The electro-competent cells of E. coli MC1061 strain carrying 
pKD46 and pPCB15 were prepared as follows. E. coli MC1061 cells 
carrying pKD46 and pPCB15 were grown in SOB medium with 100 ng/mL 
ampicillin, 25 |ig/ml chloramphenicol, and 1 mM L-arabinose at 30 °C to 

35 an OD 60 o of 0.5, following by chilling on ice for 20 min. Bacterial cells 
were centrifuged at 4,500 rpm using a Sorvall® RT7 PLUS (Kendro 
Laboratory Products, Newton, CT) for 10 min at 4 °C. After decanting the 
supernatant, the pellet was resuspended in ice-cold water and centrifuged 
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again. This was repeated twice and the cell pellet was resuspended in 
1/100 volume of ice-cold 10 % glycerol. 

Both the kanamycin marker PCR products (1-5 jag) and phage 
Pj-spromoter PCR products (1-5 \ig) were mixed with 50 \iL of the 
5 competent cells and pipetted into a pre-cooled electroporation cuvette (0.1 
cm) on ice. Electroporation was performed by using a Bio-Rad Gene 
Pulser set at 1 .8 kV, 25 jaF with the pulse controller set at 200 ohms. 
SOC medium (1 ml_) was added after electroporation. The cells were 
incubated at 37 °C for 1 hour. Approximately one-half of cells were spread 
10 on LB plates containing both 25 |ng/mL kanamycin and 25 M,g/mL 
chloramphenicol in order to select double antibiotic-resistant 
transformants. After incubating the plate at 37 °C overnight, one double 
antibiotic-resistant transformant was selected. 

EXAMPLE 3 

15 Elimination of the Antibiotic Selectable Marker using a FLP Recombinase 

Expression Plasmid 
The Flp recombinase expression plasmid pCP20 (amp R ), which 
has a temperature-sensitive replication origin, was transiently transformed 
into the kanamycin-resistant transformant by electroporation. Bio-Rad 

20 Gene Pulser (Bio-Rad Laboratories, Hercules, CA) set at 1.8 kV, 25 ixF 
with the pulse controller set at 200 ohms was used for electro- 
transformation. Cells were spread on 100 jag/mL ampicillin and 25 jug/mL 
chloramphenicol LB plates, and grown at 30 °C for 1 day. Colonies were 
picked and streaked on 25 jig/mL chloramphenicol LB plates without 

25 ampicillin antibiotics and incubated at 43 °C overnight (Plasmid pCP20 
has a temperature sensitive origin of replication and was cured from the 
host cells by culturing cells at 43°C). The colonies were tested for 
ampicillin and kanamycin resistance to test loss of pCP20 and kanamycin 
selectable marker by streaking colonies on 100 |ug/mL ampicillin LB plate 

30 or 25 |^g/mL kanamycin LB plate. The kanamycin selectable marker was 
removed by Flp recombination between the FRT sites by transient 
transformation of a Flp expression plasmid based on the temperature- 
sensitive origin. 

EXAMPLE 4 

35 Confirmation of Chromosomal Integration of the Phage P7- 5 Promoter in 

Front of the dxs Gene by PCR Analysis 
The chromosomal integration of both the kanamycin selectable 
marker and phage P T5 promoter in the front of dxs gene on E. coli 



chromosome was conformed by PCR analysis. A colony of transformants 
was resuspended in 50 jaL of PCR reaction mixture containing 200 |uM 
dNTPs, 2.5 U Ampli Taq Gold™ (Applied Biosytems), 0.4 jaM of different 
primer pairs, T1 (5'- CGCCAGTCGCTGAAACAACTGGCTGA-3')(SEQ ID 
5 NO:5) and T2 (5'-CAGTCATAGC CGAATAGCCT-3')(SEQ ID NO:6), T3 
(5'-CGGTGCCCTGAATGAACTGC-3 , )(SEQ ID NO:7) and T4 (5'- 
TGGCAACA GTCGTAGCTCCTGGGTGG-3') (SEQ ID NO:8), T2(T5) and 
T4, and T1 and T4. Test primers were chosen to amplify regions located 
either in the vicinity of the integration region or in the kanamycin and 

10 phage P T5 promoter region (Figure 6). PCR test with T1 and T2, T3 and 
T4, T2(T5) and T4, and T1 and T4 primer pairs revealed the expected 
sizes, 682 bp, 1 1 16 bp, 229 bp and 1887 bp on 1 % agarose gel, 
respectively (Figure 6). PCR analysis was repeated after elimination of 
the kanamycin selectable marker, and showed no PCR products for T1 

15 and T2, and T3 and T4, and the expected sizes 229 bp and 494 bp for 
T2(T5)/T4 and T1/T4, respectively (Figure 6). The PCR results indicated 
the correct integration of the P T5 promoter fragment in the front of 
chromosomal dxs gene, yielding E. coli P T5 -dxs. 

EXAMPLE 5 

20 Measurement of (3-Carotene Production in E. coli P T 5 -dxs 

The chromosomal integration of the Pj 5 promoter in the front of dxs 
gene was further characterized by measuring the p-carotene production of 
E. coli Pj 5 -dxs containing the reporter plasmid, pPCB15, along with E. coli 
control strain also containing pPCB15. E. coli P T5 -dxs and E. coli control 

25 strains were grown in 5 ml_ LB containing 25 jug/mL chloramphenicol at 
37°C for 24 h, and then harvested by centrifugation at 4000 rpm for 10 
min. The p-carotene pigment was extracted by resuspending the cell 
pellet in 1 mL of acetone with vortexing for 1 min and then shaking the 
sample for 1 h at room temperature by using Rocking Platform 200 (VWR 

30 International, West Chester, PA). After centrifuging the sample at 4000 
rpm for 10 min, the absorption spectrum of the acetone layer containing p- 
carotene was measured at X 450 nm by using Ultrospec 3000 
spectrophotometer (Amersham Biosciences, Piscataway, NJ). The 
production of p-carotene in E. coli P T5 -dxs was approximately 3-fold 

35 higher than the control strain (Figure 7). The results indicate that the 
increased production of p-carotene in E. coli P T5 -dxs was due to the 
increased expression of the dxs gene under control of the strong P T5 
promoter. The E. coli P T5 -dxs construct was shown to be correct by 
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testing its function from the measurement of p-carotene in E. coli P T5 -dxs 
versus control strain as well as by PCR analysis. 

EXAMPLE 6 

Engineering the Promoter of the idi Gene on the E. coli Chromosome - 
5 Preparation of Two Linear (PCR-generated) DNA Fragments, Electro- 
transformation, and Elimination of the Antibiotic Selectable Marker 
The general applicability of this invention to the chromosomal 
promoter replacement was further examined by engineering the promoter 
of idi gene on E. coli chromosome. The endogenous promoter, operably 
linked to idi gene located at 65.3 min of the E. coli chromosome, was 
replaced with a P T5 strong promoter by using two PCR-generated DNA 
fragments. The DNA fragments were synthesized by PCR as described in 
Example 1, except using different primers. The linear DNA fragment 
(1489 bp) containing a kanamycin selectable marker flanked by FRT 
sequences was synthesized by PCR from plasmid pKD4 (Datsenko and 
Wanner, supra) with primer pairs, T(idi) (5'- 

TCTGATGCGCAAGCTGAAGAAAAATGAGCATGGAGAATAATATGA CG 
TCTTGAGCGATTGTGTAG-3 , )(SEQ ID NO:9) which contains a h4 
homology arm (underlined, 45 bp) chosen to match a sequence in the 
upstream region of the ygfU stop codon and a priming sequence (20 bp) 
and B1(T5) that contains a h2 homology arm (underlined, 50 bp) chosen 
to match sequences in the 5'-end region of the promoter DNA fragment 
and a priming sequence (20 bp) (Figure 8). A second linear DNA 
fragment (154 bp) containing a P T5 promoter was synthesized by PCR 
from pQE30 (QIAGEN, Inc.) with primer pairs, T2(T5) (5'- 
TAACCTATAAAAA TAGGCGTATCACGAGG CCC-3') (SEQ ID NO:3) 
that contains a priming sequence (32 bp) and B(idi) (5'- 
TG G G AACTC CCTGTG C ATTC AATAAAATG AC GTGTTC C GTTTG 
CATAGTTAATTTCTCCTCTTTAATG-3')(SEQ ID NO: 10) which contains a 
30 h5 homology arm (underlined, 46bp) chosen to match sequences in the 
downstream region of the idi start codon and a priming sequence (22bp) 
(Figure 8). The underlined sequences illustrate each respective homology 
arm, while the remainder is the priming sequences for hybridization to 
complementary nucleotide sequences on the template DNA for the PCR 
35 reaction. The two resultant PCR fragments were the kanamycin selectable 
marker containing the homology arms (h2 and h4) and the Pj^promoter 
containing the homology arm (h5) as illustrated in Figure 8. 
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The PCR reaction, purification, and electro-transformation were 
performed as described in Example 1 and 2. The purified kanamycin 
cassette PCR products (1-5 |ng) and P T5 promoter PCR products (1-5 p,g) 
were co-transformed into E. coli expressing X-Red recombinase. Cells 
5 were plated and screened for antibiotic-resistant transformants as 
described in Example 1. Fourteen kan R and cam R double-resistant 
transformants were selected. The kanamycin selectable marker was 
eliminated according to the method as described in Example 3. 

EXAMPLE 7 

io Confirmation of Chromosomal Integration of the P T r Promoter in Front of 

the idi Gene 

Among fourteen double antibiotic-resistant transformants, five 
transformants were arbitrarily selected and were analyzed by PCR 
analysis with different combination of specific primer pairs, T5 (5'- 

15 TCATGCTGACCTGGTGAAGGAATCC-3')(SEQ ID NO:1 1) and T2, 
T2(T5) and T6 (5 - TCATGCTGACCTGGTGAAGGAATCC-3')(SEQ ID 
NO: 12), and T5 and T6 (Figure 9). The PCR reaction was performed as 
described in Example 4. Test primers were chosen to amplify sequences 
located either in the vicinity of the integration region or in the kanamycin 

20 and the P T5 promoter region. PCR test with T5 and T2, T2(T5) and T6, 
and T5 and T6 primer pairs for all five transformants gave the expected 
sizes, 627 bp, 274 bp and 1877 bp on 1 % agarose gel, respectively. 
After elimination of the kanamycin selectable marker as described in 
Example 3, PCR analysis with T5 and T2 showed no PCR products. PCR 

25 analysis with T2(T5) and T6 and T5 and T6 primer pairs was repeated and 
showed PCR products with the expected sizes or 274 bp and 484 bp, 
respectively. The results indicated the correct integration of the kanamycin 
selectable marker and the P T5 promoter fragment in the front of 
chromosomal /d/'gene, yielding E. coli P T5 -idi. 

30 The chromosomal integration of a P T5 promoter in the front of idi 

gene was further confirmed by quantification of the p-carotene production 
of E. coli Pjt^idi versus control strain as described in Example 5. The 
E. coli Pjs-idi exhibited a 1.5-fold increase in the production of p-carotene 
when compared to the control strain containing the idi gene linked to its 

35 native promoter (Figure 7). 
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EXAMPLE 8 

Construction of a Template Plasmid Carrying a Fused Antibiotic 
Selectable Marker and Promoter 
The plasmid carrying a fused kanamycin selectable marker and P T5 
5 promoter was constructed to facilitate the chromosome engineering. The 
plasmid containing a fused kanamycin selectable marker and the P T5 
promoter was constructed by cloning a Pj 5 strong promoter region into the 
A/del restriction endonuclease site of pKD4 (Datsenko and Wanner, 
supra) (Figure 10). First, the linear DNA fragment containing the P T5 
10 promoter element (154 bp) comprised of the -10 and -35 consensus 
sequences, /acO, and rbs was synthesized by PCR from pQE30 
(QIAGEN, Inc.) with primer pairs, TT5 (5*- 

ATCATGACATTAA CATATG AAAATAGGCGTATCACGAGGCCC-3 ,> ) 
(SEQ ID NO: 13) and BT5 (5'- 

15 GATGCGATCCCATATGAGTTAATTTCTCCTCTTTAATG-3') (SEQ ID 
NO: 14). The underlined sequences in the primers represent the sequence 
of the A/del site. The PCR reaction condition was same as described in 
Example 1. The resulting PCR product was digested with A/del and cloned 
into A/del-digested pKD4, yielding pSUH5 (SEQ ID NO:36; Figure 10). The 

20 plasmid pSUH5 was used as a template to generate the linear PCR DNA 
fragment containing the fused kanamycin selectable marker-phage Pj 5 
promoter. The pSUH5 construction was confirmed by DNA sequencing 
(DuPont CR&D DNA sequence facility, Wilmington, DE). 

EXAMPLE 9 

25 Chromosomal Integration of the Methvlomonas 16A dxs Gene in E. coli- 
Preparation of Two Linear DNA Fragments: (1) Fused Antibiotic 
Selectable Marker-Promoter and (2) Foreign Gene, Electro- 
transformation and Elimination of the Antibiotic Selectable Marker 
The applicability of this invention was further extended to 
chromosomal integration of a foreign gene into E. coli. This example 
describes the chromosomal integration of the Methylomonas 16a dxs 
gene (denoted as "dxs(16a)" in the present invention; SEQ ID NO: 15) into 
the region between genes located at 30.9 min of E. coli chromosome by 
homologous recombination between the E coli chromosome and two 
linear DNA fragments, a fused kanamycin selectable marker-phage P T5 
promoter and the Methylomonas 16a dxs gene. The dxs gene is the 
Methylomonas 16a (ATCC PTA-2402) homologue of the E. coli dxs gene. 
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The scheme for the chromosomal integration of the Methylomonas 16a 
dxs gene by homologous recombination is shown in Figure 11. 

The linear DNA fragment containing fused kanamycin selectable 
marker-phage Pj 5 promoter was synthesized by PCR from pSUH5 with 
5 primer pairs, T1 (dxsl 6a) (5'- 

CACTAACGCCCGCACATTGCTGCGGGCTTTTTGATTCATTTCGCA CG 
TCTTGAGCGATTGTGTAG-3')(SEQ ID NO: 16) which contains a h6 
homology arm (underlined, 45bp) chosen to match a sequence in the 
region between genes located at 30.9 min of E. coli chromosome and a 

10 priming sequence (20bp) and B1(dxs16a) (5'- 

AGTAGAGGGAAGTCTTTGGAAAGAGCCATA GTTAATTTCTCCTCTTTA 
ATG-3')(SEQ ID NO: 17) which contains a h7 homology arm (underlined, 
29bp) chosen to match a sequence in the downstream region of the 
dxs(16a) start codon and a priming sequence (22bp) (Figure 11). The 

15 linear DNA fragment containing the Methylomonas 16a dxs gene was 

synthesized by PCR from Methylomonas 16a (ATCC PTA-2402) genomic 
DNA with primer pairs, T2(dxs16a) (5'- 

ACAGAATTCATTAAAGAGGAGAAATTAACTA TGGCTCTTTCCAAAGAC 
TTCCCTC-3')(SEQ ID NO: 18) which contains a h8 homology arm 
20 (underlined, 30bp) chosen to match a sequence in the 3'-end region of the 
fused kanamycin selectable marker- P T5 promoter and a priming 
sequence (25 bp) and B2(dxs16a) (5'- 

AGGAGCGAAGTGATTATCAGTATGCTGTTCATATAGCCTCGAATTA TC 
AAGCGCAAAACTGTTCGATG-3')(SEQ ID NO: 19) which contains a h9 
homology arm (underlined, 46bp) chosen to match a sequence in the 
region between genes located at 30.9 min of the E. coli chromosome and 
a priming sequence (22 bp) (Figure 11). The underlined sequences 
illustrate each respective homology arm, while the remainder is the 
priming sequences for hybridization to complementary nucleotide 
sequences on the template DNA for the PCR reaction. The two resultant 
PCR fragments were the kanamycin selectable marker containing the 
homology arms (h6 and h7) and the dxs(16a) gene containing the 
homology arms (h8 and h9) as illustrated in Figure 11. 

The PCR reaction, purification and electro-transformation were 
performed as described in Example 1 and 2. Both the fused kanamycin- 
phage P T5 promoter PCR products (1-5 \ig) and the Methylomonas 16a 
dxs PCR products (1-5 jig) were co-transformed into an E. coli host cell 
expressing the X-Red recombinase system. Transformants were selected 
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as previously described and seven kan R and cam R double-resistant 
transformants were selected. The kanamycin selectable marker was 
eliminated according to the method as described in Example 3. 

EXAMPLE 10 

5 Confirmation of Chromosomal Integration of the Fused Antibiotic 

Selectable Marker-Promoter and the Foreign dxs Gene from 

Methylomonas 16A. 
All seven double antibiotic-resistant transformants were PCR- 
analyzed with different combination of specific primer pairs, T7 (5'- 

10 ACCGGATATCACCACTTAT CTGCTC-3')(SEQ ID NO:20) and T2, T7 
and T8 (5'-ATGCTGACCGTGTGGGTCAGATAGC-3')(SEQ ID NO:21), 
T2(T5) and T9 (5'-GCGATATTGTATGTCTGATTCAGGA-3')(SEQ ID 
NO:22), and T7 and T9 (Figure 12). Test primers were chosen to amplify 
sequences located either in the vicinity of the integration region of the 

15 kanamycin selectable marker- P T5 promoter region or the Methylomonas 
16a dxs gene as shown in Figure 12. The PCR reaction was performed as 
described in Example 4. From the PCR analysis with pairs T7 and T2, T7 
and T8, T2(T5) and T9, and T7 and T9, two out of seven transformants 
gave the expected sizes, 643 bp, 1908 bp, 2184 bp and 3803 bp on 1 % 

20 agarose gel, respectively. The results indicated that the two transformants 
had the correct integration of the fused kanamycin selectable marker- P T5 
promoter fragment and the Methylomonas 16a dxs gene into the region 
between genes located at 30.9 min of E. coli chromosome. After 
elimination of the kanamycin selectable marker, PCR analysis with primer 

25 pair T7 and T2 resulted in no PCR product. PCR analysis with primer 

pairs T7 and T8, T2(T5) and T9, and T7 and T9 showed the expected size 
fragments of 515 bp, 2184 bp and 2410 bp on 1 % agarose gel, 
respectively. The resultant E. coli P T5 -dxs(16a) transformation was tested 
for its (3-carotene production as described in Example 5 and exhibited 3.3- 

30 fold higher in the production of p-carotene than control strain (Figure 7). 

EXAMPLE 11 

Chromosomal Integration of the P. stewartii crtE Gene in E. coli- 
Preparation of Two Linear DNA Fragments: (1) Fused Kanamycin 
Selectable Marker- P T 5 Promoter and (2) the P. stewartii crtE Gene. 
35 This example describes the chromosomal integration of P. stewartii 

crtE and crtIB genes into the inter-operon region located at 81 .2 min of 
E. coli chromosome by integration of P. stewartii crtE and P. stewartii crtIB 
(Figure13). The crtE, crtl t and crtB genes encode geranylgeranyl 



pyrophosphate synthase, phytoene dehydrogenase, and phytoene 
synthase, respectively. These genes are part of the carotenoid 
biosynthetic pathway (Figure 3). 

The linear DNA fragment containing fused kanamycin selectable 
5 marker- P T5 promoter is synthesized by PCR from pSUH5 with primer 
pairs, T1(crtE) (5'- 

AGCCGTCGCAGGAGGAACAACTCATATCATCATTGCGATCTCGAC CG 
TCTTGAGCGATTGTGTAG-3')(SEQ ID NO:23) which contains a h10 
homology arm (underlined, 45bp) chosen to match a sequence in the 
10 inter-operon region located at 81.2 min of E. coli chromosome and a 
priming sequence (20bp) and B1(crtE) (5'- 

TGAACGTGTTTTTTTGCGCAGACCGTCATA GTTAATTTCTCCTCTTTA 
ATG-3')(SEQ ID NO:24) which contains an h11 homology arm 
(underlined, 29bp) chosen to match a sequence in the downstream region 
15 of the crtE start codon and a priming sequence (22bp) (Figure 13). The 
linear DNA fragment containing P. stewartii crtE gene was synthesized by 
PCR from pPCB15 (Figure 5) with primer pairs, T2(crtE) (5'- 
AC AG AATTC ATTAAAG AG G AG AAATTAACT ATG AC G GTCTG C G C AAA 
AAAACACG-3')(SEQ ID NO:25) which contains an h8 homology arm 
20 (underlined, 30bp) chosen to match a sequence in the 3'-end region of the 
fused kanamycin selectable marker- P T5 promoter and a priming 
sequence (25 bp) and B2(crtE) (5'- 

AG AATG AC C AG CTG G ATG C ATTATCTTTATTTG G ATC ATTG AG G G TTA 
ACTGACGGCAGCGAGTT-3')(SEQ ID NO:26) which contains an h12 
homology arm (underlined, 45bp) chosen to match a sequence in the 
inter-operon region located at 81.2 min of the E. coli chromosome and a 
priming sequence (20 bp) (Figure 13). The underlined sequences illustrate 
each respective homology arm, while the remainder is the priming 
sequences for hybridization to complementary nucleotide sequences on 
the template DNA for the PCR reaction. The two resultant PCR fragments 
were the fused kanamycin selectable marker- P T5 promoter containing the 
homology arms (h10 and hi 1) and the P. stewartii crtE gene containing 
the homology arms (h8 and h12) as illustrated in Figure 13. 

The PCR reaction, purification, and electro-transformation were 
performed as described in Example 2 except that the transformation of the 
reporter plasmid pPCB15 into E. coli strain was omitted. As noted 
previously, plasmid pPCB15 contains the Pantoea stewartii (ATCC 8199) 
crtEXYIB gene cluster. In order to measure for functional expression of 
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any of the genes chromosomally engineered into E. coli as described in 
Examples 12 and 13, it was necessary to omit the reporter plasmid. Both 
fused kanamycin marker- P T5 promoter PCR products (1-5 jag) and the P. 
stewartii crtE PCR products (1-5 \xg) were co-transformed into an E. coli 
5 host strain (MC1061) expressing the A,-Red recombinase system by 
electroporation as previously described in Example 2 except for the 
omission of the reporter plasmid as described above. Transformants were 
selected on 25 |ag/mL kanamycin LB plates at 37 °C. After incubating the 
plate at 37 °C overnight, two kan R resistant transformants were selected. 

10 Two kan R resistant transformants were PCR analyzed with T10 (5- 

CCATGACCCTACATTGTGATCTATAG-3 , )(SEQ ID NO:27 and T13 (5'- 
GGAACCATTGAACTGGACCCTAACG-3')(SEQ ID NO:33) primer pair 
(Figure 14). PCR analysis was performed under same PCR reaction 
condition as described in Example 4. PCR testing with T10/T13 on two 

15 transformants exhibited the expected size, 2883bp, on 1% agarose gel. 
The result indicated the correct integration of the fused kanamycin 
selectable marker- P T5 promoter DNA fragment along with P. stewartii 
crtE gene into the inter-operon region located at 81 .2 min of E. coli 
chromosome, yielding E. coli Pj 5 -crtE. 

20 EXAMPLE 12 

Chromosomal Integration of the P. stewartii crtIB Genes in E. coli P j ^-crtE 
to Construct P T 5 -crtEIB - Preparation of Two Linear DNA Fragments: (1) 
Fused Kanamycin Selectable Marker- P T5 Promoter-P. stewartii crtE 
Gene and (2) P. stewartii crtIB Genes, Electro-transformation, and 

25 Elimination of the Antibiotic Selectable Marker 

The linear DNA fragment containing the fused kanamycin 
selectable marker- P T5 promoter-P. stewartii crtE gene was synthesized 
by PCR from the genomic DNA of E. coli P T $-crtE with primer pairs, T10 
(SEQ ID NO:27) which contains a priming sequence (26bp) corresponding 

30 to the 162 bases in the upstream region of the integration site of the fused 
kanamycin selectable marker- P T5 promoter-c/tE gene in E. coli and 
B1 (crtIB) 

(S^- TCCTCCAGCATTAAGCCTGCCGTCGCCTT TTAACTGACGGCAGCG 
AGTTTTTTGTC-3')(SEQ ID NO:28) which contains an h14 homology arm 
35 (underlined, 29bp) chosen to match sequences in the downstream region 
of the crtl start codon and a priming sequence (27bp) (Figure 13). The 
linear DNA fragment containing P. stewartii crtIB gene was synthesized by 
PCR from pPCB15 (Figure 5) with primer pairs, T2(crtlB) (5- 



TTTGACAAAAAACTCGCTGCCGTCAGTTAAA AGGCGACGGCAGGCTT 
AATGCTG-3')(SEQ ID NO:29) which contains an h14 homology arm 
(underlined, 30bp) chosen to match a sequence in the 3'-end region of the 
fused kanamycin selectable marker- P T5 promoter-c/fE gene and a 
5 priming sequence (24 bp) and B2(crtlB) (5'- 

AGAATGACCAGCTGGATGCATTATCTTTATTTGGATCATTGAGGG CTA 
GATCGGGCGCTGCCAGA-3')(SEQ ID NO:30) which contains an h12 
homology arm (underlined, 45bp) chosen to match a sequence in the 
inter-operon region located at 81.2 min of the E. coli chromosome and a 

10 priming sequence (20 bp) (Figures 5 and 13). The underlined sequences 
illustrate each respective homology arm, while the remainder is the 
priming sequences for hybridization to complementary nucleotide 
sequences on the template DNA for the PCR reaction. The two resultant 
PCR fragments were the fused kanamycin selectable marker- P T5 

15 promoter-P. stewartii crtE gene containing the homology region (162 bp) 
at the 5'-end and homology arm (h13), and the P. stewartii crtIB genes 
containing the homology arms (h14 and h12) as illustrated in Figure 13. 

The PCR reaction, purification and electro-transformation were 
performed as described in Example 2 except for omitting the step of 

20 transforming the host cell with the reporter plasmid, pPCB15, as described 
in Example 1 1 . Both the fused kanamycin selectable marker- P T5 
promoter-P. stewartii crtE gene PCR products (1-5 |ug) and the P. stewartii 
crtIB PCR products (1-5 |ng) were co-transformed into an E. coli host cell 
expressing the X-Red recombinase system by electroporation as 

25 previously described in Example 2. Transformants were selected on 
25 ^ig/mL kanamycin LB plates at 37 °C. After incubating the plate at 
37 °C overnight, one kan R resistant transformant was selected. The 
kanamycin selectable marker was eliminated according to the method as 
described in Example 3. 

30 EXAMPLE 13 

Confirmation of Chromosomal Integration of the P. stewartii crtE and crtIB 

Genes in E. coli P T *-crtEIB 
The selected kan R resistant transformant was PCR analyzed with 
different combinations of specific primer pairs, T10 and T2, T2(T5) and 

35 T12 (5 , -CTAGATCGGGCGCTGCCAGAGATGA-3')(SEQ ID NO:32), 
T11(5 , -ACACGTTCACCTTACTGGCATTTCG-3')(SEQ ID NO:31) and 
T13, and T10 and T13 (Figure 14). Test primers were chosen to amplify 
sequences located either in the vicinity of the integration region of the 
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kanamycin selectable marker- P T5 promoter-crfE fragment or the crtIB 
genes (Figure 14). PCR analysis was performed under same PCR 
reaction condition as described in Example 4. PCR test with T10 and T2, 
T2(T5 and T12, T1 1 and T13, and T10 and T13 exhibited the expected 
5 sizes, 676 bp, 3472 bp, 3478 bp and 5288 bp on 1% agarose gel, 
respectively. The elimination of the kanamycin selectable marker was 
confirmed by PCR fragment analysis (Figure 14). PCR fragment analysis 
with primer pair T10 and T2 exhibited no product formation as expected. 
PCR analysis with primer pairs T2(T5) and T12, T11 and T13, and T10 

10 and T13 exhibited the expected PCR product sizes of 3472 bp, 3478 bp, 
and 3895 bp on 1% agarose gel, respectively. The results indicated the 
correct integration of the fused kanamycin selectable marker- P T5 
promoter-P. siewartii crtE gene DNA fragment and P. stewartii crtIB genes 
into the inter-operon region located at 81.2 min of E. coli chromosome, 

15 yielding E. coli P T5 -crtEIB. 

The functional expression of the enzymes geranylgeranyl 
pyrophosphate synthase (CrtE), phytoene synthase (CrtB) and phytoene 
dehydrogenase (Crtl) in E. coli P T5 -crtEIB was confirmed by the synthesis 
of lycopene based on the production of pink pigment. After extracting 

20 lycopene with acetone as described in Example 5, lycopene production by 
E. coli Pj 5 'CrtEIB was confirmed spectrophotometrically by observing 
X max peaks at 444, 470, and 502 nm, values characteristic of lycopene. 
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